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Introduction: 

The  molecular  basis  of  human  disease  is  rapidly  being  defined.  Genetic  analyses  of  large 
affected  kindreds  have  greatly  facilitated  the  identification  of  a  gene  or  set  of  genes  when  mutated  is 
instrumental  to  the  etiology  of  that  particular  disease.  In  some  instances  the  clinical  phenotype  (i.e.  UV 
sensitivity)  presented  by  the  patient  may  provide  ideas  as  to  which  cellular  pathway(s)  (i.e.  DNA 
repair)  are  affected.  Contrary  to  this  scenario,  many  patients  lack  a  definable  clinical  phenoptype, 
which  can  be  associated  with  a  cellular  pathway,  which  may  be  affected.  Nonetheless,  once  a  new 
disease  gene  is  identified,  definition  of  its  protein’s  function  is  the  immediate  interest  in  trying  to 
understand  why  and  how  naturally  occurring  mutations  in  the  gene  perturb  the  function  of  its  protein 
and  confer  disease  susceptibility. 

As  we  begin  to  enter  the  post-genome  era,  structure/function  studies  of  newly  cloned  gene 
products  will  be  described  by  many  different  paradigms.  The  recurrent  theme  that  has  emerged  from 
the  biochemical/structural  study  of  proteins  involved  in  signal  transduction  and  transcriptional 
regulation  is  that  multiple  independently  folded  globular  domains  in  these  proteins  often  cooperate  in 
macromolecular  recognition.  The  presence  of  highly  conserved  amino  acid  signature  motifs  and  their 
spatial  arrangement  within  a  novel  protein  can  provide  the  first  clue  to  the  protein’s  function. 
Furthermore,  the  identification  of  interacting  proteins  has  been  beneficial  in  defining  the  biochemical 
function  of  a  particular  protein  and/or  the  cellular  pathway  (i.e.  cell  signalling,  transcription,  protein 
degradation)  in  which  it  functions.  Multi-disciplinary  studies  aimed  at  analyzing  a  protein’s  structure, 
biochemical  function,  and  cellular  pathway  have  been  fruitful  in  understanding  why  naturally 
occurring  mutations  in  a  given  protein  confer  susceptibility  to  disease.  These  studies  have  assisted  in 
the  design  and  development  of  better  detection  systems,  disease  diagnosis,  and  innovative  therapeutic 
strategies  for  the  intervention  of  disease.  The  purpose  of  this  post-doctoral  fellowship  was  to  broaden 
my  training  in  molecular  biology/biochemistry,  with  an  emphasis  on  understanding  the  biochemical 
functions  of  a  protein  and  its  physiologic  role  in  the  cell. 

Current  Research: 

The  fundamental  characteristic  of  cells  to  detect  extracellular  stimuli  and  execute  the 
appropriate  response  is  critical  for  Metazoan  development  and  homeostasis.  Upon  receiving  external 
signals,  finely  orchestrated  cascades  transduce  signals  from  the  cell  surface  to  the  nucleus  that 
culminate  in  the  activation  or  repression  of  defined  sets  of  genes  which  leads  to  integrated  cellular 
responses  such  as  proliferation,  differentiation,  and  apoptosis.  While  core  promoter  elements  facilitate 
the  assembly  of  and  orient  the  basal  transcription  machinery  at  the  transcription  start  site,  unique  cis- 
regulatory  elements  upstream  from  the  core  promoter  are  recognized  by  sequence-specific 
transcription  factors.  These  factors  are  frequently  the  penultimate  targets  of  extracellular  signals  and 
influence  the  rate  at  which  a  given  gene  is  transcribed.  My  current  research  interests  have  focused 
on  structural  and  biochemical  mechanisms  of  transcriptional  repression  conferred  by  superfamilies  of 
sequence  specific  transcription  factors. 

Regulation  of  RNA  polymerase  II  involves  a  complex  interplay  between  DNA-protein 
interactions  and  protein-protein  interactions.  While  the  general  transcription  factors  regulate  the 
accurate  initiation  of  transcription,  the  proteins  that  bind  gene-specific  DNA  elements  that  are 
instrumental  in  either  negatively  or  positively  regulating  the  rate  of  transcription.  The  dominant  theme 
that  has  emerged  from  the  study  of  eukaryotic  transcriptional  regulatory  proteins  is  that  they  are  highly 
modular  in  architecture  with  independent,  functionally-separable  domains  mediating  nuclear 
localization,  sequence-specific  DNA  binding,  hetero-  or  homo-oligomerization,  activation,  and 
repression  of  transcription.  Recently,  much  effort  has  been  expended  to  understand  how  activation 
and  repression  domains  transmit  the  signal  for  modulation  of  transcription  from  a  DNA-bound  protein 
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to  the  RNA  synthesis  machinery.  Studies  aimed  at  understanding  the  mechanisms  of  transcription 
repression  have  been  greatly  aided  by  the  realization  that  the  domains,  which  mediate  repression,  are 
often  highly  conserved  amino-acid  sequence  motifs,  linked  to  a  common  DNA  binding  domains. 
Examples  of  these  domains  include  BTB/POZ,  WRPW,  SNAG,  and  KRAB.  We  have  focused  on  the 
Kruppel-associated  Box  (KRAB)  domain  as  a  model  system  for  the  analysis  of  transcriptional 
repression  in  mammalian  cells  (1). 

The  KRAB  (Krttppel  associated  box)  domain  is  one  example  of  an  abundant  amino  acid 
sequence  motif  found  at  the  NH2-terminus  of  nearly  one-third  of  all  Kruppel/TFIIIA  type  C2H2  zinc 
finger  protein  (1).  This  highly  conserved  domain  displays  potent,  DNA-binding  dependent  repression 
of  transcription  that  requires  the  KAP-1  corepressor  (2,3).  The  primary  amino  acid  sequence  of  KAP-1 
revealed  the  presence  of  several  conserved  signature  motifs,  including  a  RING  finger,  B  boxes,  a 
coiled-coil  region  (RBCC),  which  collectively  form  an  integrated  domain  that  is  both  necessary  and 
sufficient  to  directly  interact  with  the  KRAB  domain  (4,5).  The  COOH-terminal  sequence  of  KAP-1 
revealed  a  conserved  PHD  finger  and  bromodomain  (6,7).  This  particular  spatial  arrangement  of 
motifs  has  defined  an  emerging  family  of  transcriptional  regulators,  which  include  TIFla  and  TIFly. 
All  members  of  this  protein  family  have  been  shown  to  repress  transcription  when  tethered  to  DNA, 
and  the  mechanisms  by  which  they  repress  transcription  are  currently  being  defined  (8,9). 

To  identify  regions  of  KAP-1  which  are  required  for  its  transcriptional  repression  function  we 
have  engineered  a  series  of  KAP-1  deletion  mutants  with  the  heterologous  GAL4  DNA  binding 
domain  to  grossly  map  repression  domains.  In  general,  we  have  found  that  amino  acids  COOH- 
terminal  to  the  RBCC  region  define  at  least  two  independent  repression  domains  within  KAP-1  that 
appear  to  be  additive  in  nature  in  order  to  obtain  maximal  levels  of  transcriptional  repression.  One  of 
these  domains  includes  the  recently  defined  HP1  binding  domain  of  KAP-1  (10,11).  Optimal 
repression  of  transcription  was  dependent  upon  the  PHD  finger  and  bromodomain.  When 
autonomously  tethered  to  DNA,  these  two  motifs  demonstrated  a  significant  level  of  repression, 
suggesting  that  they  can  function  as  an  independent  repression  domain.  Neither  the  PHD  finger  nor 
the  bromodomain  could  independently  recapitulate  this  repression  activity,  indicating  that  these  two 
motifs  function  cooperatively  to  repress  transcription.  Furthermore,  expression  of  the  segment 
encoding  the  PHD  finger  and  bromodomain  of  KAP-1  relieved  both  GAL4-KRAB  and  GAL4-KAP-1 
directed  repression,  suggesting  that  repression  by  this  bipartite  domain  is  mediated  by  at  least  one  or 
more  titratable  effector  molecules.  The  combination  of  these  data  suggests  that  the  PHD  finger  and 
bromodomain  of  KAP-1  function  as  an  integrated  unit  to  facilitate  repression  of  transcription  by  KAP- 
1.  Moreover,  these  data  implicate  an  additional  repression  mechanism  that  is  independent  of  the 
recruitment  of  HP1  proteins. 

Because  the  KAP-1  orthologues,  TIFla  and  TIFly,  have  been  reported  to  function  as 
transcriptional  repressors,  we  compared  the  repression  properties  of  the  PHD  finger  and  bromodomain 
of  all  three  family  members.  To  directly  test  this  hypothesis,  we  constructed  heterologous  fusions 
between  the  GAL4  DNA  binding  domain  and  the  PHD  finger  and  bromodomains  of  TIFla,  TIFly,  and 
WCRF180.  In  addition,  we  engineered  chimeric  proteins  in  which  either  the  PHD  finger  or 
bromodomain  of  TIFla,  TIFly,  hATRX,  Mi-2a,  WCRF180,  or  hGCN5  were  substituted  for  the  KAP- 
1  sequences  in  order  to  address  the  specificity  of  the  repression  activity  displayed  by  this  bipartite 
domain.  When  tethered  to  DNA,  the  PHD  finger  and  bromodomain  of  TIFla  demonstrated  a  potential 
to  activate  transcription  of  the  reporter  template,  while  the  PHD  finger  and  bromodomain  of  TIFly  and 
WCRF180  each  failed  to  demonstrate  any  appreciable  potentiation  of  transcription.  Heterologous 
substitution  of  the  KAP-1  PHD  finger  or  bromodomain  with  similar  sequences  from  other 
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transcriptional  regulators  failed  to  completely  complement  the  repression  activity  of  the  wild-type 
KAP-1  PHD  finger  and  bromodomain.  The  combination  of  these  data  provides  additional  support  that 
the  PHD  finger  and  bromodomain  of  KAP-1  function  as  an  integrated  functional  unit  to  specifically 
repress  transcription.  The  data  also  imply  that  not  all  PHD  fingers  and  bromodomains  are  functionally 
equivalent,  despite  demonstrating  homology  in  the  primary  amino  acid  sequence.  Furthermore,  the 
spatial  conservation  of  these  two  motifs  architecturally  in  proteins  is  not  sufficient  to  infer  a  common 
function  for  this  bipartite  domain. 

In  order  to  correlate  the  molecular  relationship  between  amino  acids  in  the  PHD  finger  and 
bromodomain  and  transcriptional  repression,  we  have  used  a  site-directed  mutagenesis  approach.  All 
amino  acid  substitutions  were  made  in  the  context  of  the  heterologous  GAL4-PHD 
finger/bromodomain  fusion  protein.  The  effect  of  each  mutation  on  the  repression  properties  of  this 
bipartite  domain  was  determined  in  DNA-template  based  assays.  As  an  initial  strategy,  we  have 
targeted  amino  acid  residues  in  the  PHD  finger  of  KAP-1  which  spatially  parallel  the  position  of 
naturally  occurring  mutations  in  hATRX  that  confer  an  inherited  susceptibility  to  developmental  defect 
(12).  Second,  we  targeted  amino  acid  residues  that  are  strictly  conserved  among  PHD  fingers  most 
closely  related  to  KAP-1.  The  final  scheme  focused  on  the  PHD  finger  involved  mutating  amino  acids 
in  the  variable  regions  of  the  KAP-1  PHD  finger  to  residues  present  in  the  PHD  fingers  of 
TIFla/TIFly.  In  general,  mutation  of  any  of  the  putative  zinc-chelating  cysteine  residues  and  several 
highly,  conserved  hydrophobic  amino  acids  significantly  disrupted  the  repression  activity  of  this 
bipartite  domain  (Schultz  et  al,  submitted,  Capili  et  al,  submitted). 

To  gain  additional  insights  into  the  biochemical  function  of  the  PHD  finger,  we  expressed  the 
minimal  KAP-1  PHD  (amino  acids  618-679)  in  Escherichia  coli  and  purified  the  protein  to 
homogeneity  using  affinity  and  size  exclusion  chromatographies.  A  combination  of  inductively 
coupled  plasma  (ICP)  spectrometry,  circular  dichroism,  and  homonuclear/heteronuclear  NMR 
techniques  revealed  that  the  KAP-1  PHD  finger  binds  zinc  in  a  cross-brace  topology  between  anti¬ 
parallel  (3-strands  reminiscent  of  RING  domains  (Capili  et  al  submitted).  To  determine  the  effects  that 
mutations  impose  on  the  structural  properties  of  the  PHD  finger,  we  expressed  the  PHD  protein 
containing  mutations  of  three  different  types  (hydrophobic  core,  zinc  chelation,  addition  of  an  extra 
zinc  ligand).  Gel  filtration,  ID  NMR  and  CD  studies  on  all  three  mutations  indicated  that,  as  expected, 
disruption  of  the  hydrophobic  core  or  of  zinc  binding  resulted  in  unfolded  proteins.  'H  NMR  studies 
indicate  that  a-carbon  resonances  corresponding  to  secondary  structural  elements  present  in  the 
wildtype  proteins  are  absent  in  the  mutants.  In  addition,  there  is  poor  dispersion  of  the  resonances  in 
the  amide  region  of  the  spectra  of  all  three  mutants  indicative  of  an  unfolded  protein.  Further,  there  are 
few  NOEs  observed  in  2D  'H  NOESY  experiments  again  demonstrating  that  these  proteins  are  largely 
unstructured.  Thus,  the  ability  of  the  KAP-1  PHD  domain  to  bind  zinc  and  maintain  the  hydrophobic 
core  are  both  required  for  its  proper  folding  and  ability  to  repress  transcription. 

We  also  used  the  information  provided  by  the  solution  structure  of  the  pCAF  bromodomain  to 
design  six  independent  mutations  in  each  of  the  four  a-helices  and  one  mutation  in  the  variable  loop 
region  between  helix  Z  and  helix  A  of  the  KAP-1  bromodomain  (13).  Mutations  in  both  helix  B  and 
helix  C  significantly  impaired  transcriptional  repression.  When  these  mutations  were  mapped  back 
onto  the  3D  structure,  each  is  presumed  to  effect  amino  acids  that  contribute  to  the  hydrophobic  core 
and  the  stability  of  the  structure.  Interestingly,  the  mutation  in  the  variable  loop  region  between  helix 
Z  and  helix  A  also  displayed  an  intermediate  effect  on  the  repression  potential  of  this  domain, 
reaffirming  the  potential  role  for  this  region  in  macromolecular  recognition.  Overall,  the  data  from  this 
mutational  analysis  is  consistent  with  the  PHD  finger  and  bromodomain  of  KAP-1  functioning 
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cooperatively,  as  independent  mutations  in  either  domain  can  significantly  disrupt  the  intrinsic 
repression  potential  of  this  bipartite  domain.  One  interpretation  of  this  data  is  that  these  two  motifs 
provide  an  interface  for  protein-protein  interactions  with  downstream  effectors  of  transcriptional 
repression. 

To  identify  potential  mechanisms  of  transcriptional  repression  directed  by  the  KAP-1  PHD 
finger  and  bromodomain,  a  heterologous  fusion  between  the  LEXA  DNA  binding  domain  and  the 
PHD  finger  and  bromodomain  of  KAP-1  was  used  in  a  yeast  two-hybrid  screen.  To  confirm  the 
specificity  of  any  potential  protein  interactions  identified  in  this  screen,  we  utilized  several  of  the 
amino  acid  substitutions  engineered  for  the  evaluation  of  the  structure/function  analysis  of  this 
bipartite  domain.  Four  different  gene  products  were  recovered  based  on  coactivation  of  both  the 
integrated  LEXA  responsive  His3  and  LacZ  reporter  genes.  The  nucleotide  sequence  of  three, 
independent  transformants  of  one  gene  product  indicated  that  the  sequence  was  identical  to  the  3’ 
nucleotide  sequence  of  the  dermatomyositis-specific  autoantigen,  Mi-2a/CHD3  (14).  Upon 
reintroduction  into  yeast,  this  GAL4  activation  domain  fusion  protein  displayed  robust  ability  to 
activate  the  LacZ  reporter  with  the  wild-type  LEXA-KAP-1  fusion  protein,  but  failed  to  activate  the 
reporter  with  other  irrelevant  test  baits.  Two  PHD  finger  mutants  that  demonstrated  nearly  wild-type 
levels  of  transcriptional  repression  activity  also  displayed  the  ability  to  activate  the  LacZ  reporter.  In 
contrast,  zinc  chelating  or  hydrophobic  core  mutations  in  the  PHD  finger,  which  possess  impaired 
repression  activity,  either  failed  to  activate  these  reporters  or  activated  with  reduced  efficiency, 
respectively.  This  putative  protein-protein  interaction  was  also  observed  to  be  dependent  on  a 
functional  bromodomain.  These  observations  are  completely  consistent  with  the  transcriptional  effects 
observed  in  transient  transfection  assays  in  which  the  repression  activity  was  dependent  upon  both 
domains.  Furthermore,  mutations  in  either  the  PHD  finger  or  the  bromodomain  were  independently 
sufficient  to  ablate  the  repression  function  of  this  bipartite  domain.  The  combination  of  these  data 
strengthens  the  argument  that  the  PHD  finger  and  bromodomain  of  KAP-1  function  as  an  integrated 
functional  unit,  which  provides  an  interface  for  protein-protein  interactions  with  molecules  that 
facilitate  repression  by  the  KRAB:KAP-1  complex. 

The  Mi-2  family  of  proteins  has  been  described  as  an  integral  component  of  a  high  molecular 
weight  multiprotein  complex  containing  histone  deacetylase  activity  (15,16).  To  verify  an  interaction 
in  vivo  between  KAP-1  and  Mi-2a  we  used  coimmunoprecipitation  experiments  of  endogenous 
proteins  from  phosphocellulose  fractionated  HeLa  nuclear  extracts.  Three  independent  antibodies 
against  nonoverlapping  antigens  of  KAP-1  specifically  coprecipitated  Mi-2a,  HDAC1,  and  RbAp48, 
three  constitutive  polypeptides  of  the  NuRD  complex.  This  result  demonstrates  an  association  between 
KAP-1  and  Mi-2a  in  vivo.  To  determine  the  specificity  of  this  interaction  in  vivo,  we  transiently 
transfected  293  cells  with  Flag-tagged  wild-type  KAP-1  or  a  Flag-tagged  delta  PHD/bromodomain 
construct.  Coimmunoprecipitation  experiments  revealed  the  specific  precipitation  of  endogenous  Mi- 
206  with  the  wild-type  KAP-1  protein,  but  not  with  the  delta  PHD/bromodomain  KAP-1  protein.  This 
data  demonstrates  that  the  association  of  KAP-1  with  Mi-2a  specifically  requires  sequences  in  the 
PHD  finger  and  bromodomain.  In  separate  experiments,  overexpression  of  the  C-terminal  amino  acids 
of  Mi-2  alpha  can  dominantly  inhibit  KRAB: KAP-1 -mediated  repression  in  vivo.  Since  Mi-2  has  been 
biochemically  purified  as  an  integral  component  of  a  core  histone  deacetylase  complex,  these  studies 
suggest  that  the  KRAB: KAP-1  repression  complex  assists  in  targeting  the  biochemical  activities  of 
NuRD  to  specific  promoters  in  vivo.  Consistent  with  this  hypothesis,  addition  of  trichostatin  A  to 
transient  transfection  assays  revealed  that  the  transcriptional  repression  exerted  by  the  PHD  finger  and 
bromodomain  of  KAP-1  is  partially  reversible,  indicating  the  potential  involvement  of  histone 
deacetylases  in  KAP-1  mediated  repression  (Schultz  et  al.,  submitted).  These  data  provide  evidence 
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that  the  KRAB-ZFP:KAP-1  repressor xorepressor  complex  functions  to  target  histone  deacetylases  to 
KRAB-zfp  target  genes  in  vivo. 


Future  Directions: 

The  goal  of  this  research  fellowship  was  to  further  broaden  my  training  in  molecular 
biology/biochemistry  with  a  particular  focus  on  understanding  the  biochemical  functions  of  novel  gene 
products,  which  may  have  a  role  in  the  etiology  of  human  disease.  The  findings  of  the  current  research 
have  greatly  fostered  the  development  of  new  hypotheses  to  test  in  the  future.  As  I  prepare  to  establish 
an  independent  laboratory,  the  focus  of  my  research  will  continue  to  use  genetic  and  biochemical 
approaches  to  ask  questions  about  the  molecular  relationship  between  the  biochemical  functions  of  a 
protein  and  its  physiologic  role  in  the  cell.  In  particular,  I  am  interested  in  structural  and  biochemical 
mechanisms  of  transcriptional  repression  conferred  by  superfamilies  of  mammalian  sequence 
specific  transcription  factors.  The  long-range  goal  of  this  work  is  to  understand  how  targeted 
recruitment  of  repressor:corepressor  molecules  facilitate  transcriptional  repression  of  chromatin 
templates.  The  rationale  for  this  research  is  that  once  the  mechanics  of  transcriptional  repression  of 
gene  promoters  at  the  chromatin  level  is  known,  new  and  innovative  approaches,  both  genetic  and 
pharmocologic,  can  be  designed  to  reverse  aberrant,  epigenetic  silencing  of  genes  involved  in  a  variety 
of  diseases.  This  direction  is  particularly  relevant  to  breast  cancer,  as  several  genes  have  been  shown 
to  be  epigenetically  silenced  (i.e.  E-cadherin)  during  mammary  tumorigenesis.  Furthermore,  a  recent 
report  has  described  the  association  of  a  novel  KRAB-ZFP  with  BRCA1  (17).  This  association  implies 
a  role  for  KAP-1  in  BRCAlmediated  transcriptional  repression. 

Our  studies,  which  represent  the  first  structural  characterization  of  a  PHD  domain,  will  have 
widespread  implications,  as  more  than  400  independent  proteins  have  been  described  to  possess  a  PHD 
finger.  Moreover,  our  structural  analysis  help  explain  the  consequences  of  site-specific  mutations  in 
the  PHD  finger  of  the  hATRX,  AIRE1,  and  ING1  proteins  which  predispose  individuals  to 
developmental  defects  and  cancer.  The  coordination  of  zinc  in  a  cross-brace  topology  between  anti¬ 
parallel  |3-strands  reminiscent  of  RING  domains  invokes  the  possibility  that  the  PHD  finger  may  have 
RING-like  biological  activities.  Recent  studies  have  shown  that  some  RING  domains  function  as  E3 
ubiquitin  ligases,  components  of  the  ubiquitin  degradation  pathway.  Because  of  the  similarity  of  the 
PHD  finger  to  the  RING,  it  is  reasonable  to  hypothesize  that  the  PHD  finger  may  also  function  as  an 
E3  for  ubiquitin  or  any  other  ubiquitin-like  molecule  We  have  initiated  experiments  to  test  whether  the 
RING  finger  and  PHD  finger  of  KAP-1  or  any  other  members  of  the  TIF1  family  of  transcriptional 
regulators  possess  this  E3  activity.  Such  studies  may  provide  information  as  to  how  this  unique  family 
of  proteins  functions  to  regulate  transcription,  and  moreover,  may  provide  insights  into  how  these 
proteins  themselves  are  regulated.  As  an  extension  of  our  structural  studies,  we  are  attempting  to  solve 
the  crystal  structure  of  a  PHD  finger/bromodomain  protein.  Since  our  results  indicate  that  the  PHD 
finger  and  bromodomain  of  KAP-1  likely  function  as  an  integrated  unit,  these  results  are  expected 
provide  new  insights  as  to  how  this  bipartite  domain  functions  in  transcriptional  repression. 


Second,  I  am  interested  in  understanding  the  mechanics  of  transcriptional  silencing  of  genes  at 
single  copy  genomic  loci.  We  are  particularly  well  prepared  to  undertake  the  proposed  research, 
because  we  have  extensively  defined  the  biochemical  mechanisms  of  transcriptional  repression 
utilized  by  the  model  KRAB:KAP-1  repressor: corepressor  system,  including  heterochromatinization, 
histone  deacetylases,  and  histone  methylases.  From  these  data,  we  postulate  that  transcriptional 
repression  in  vivo  results  from  the  ordered  progression  of  independent  molecular  events.  We  will  use 
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the  well-defined  KRAB:KAP-1  repression  system  to  dissect  the  spatial  and  temporal  order  of  these 
events  at  a  heterologous-targeted  transgene  locus.  Chromatin  immunoprecipitation  and  micrococcal 
nuclease  digestion  experiments  will  be  used  to  monitor  the  physical  properties  of  the  transcriptionally 
silent  chromatin  locus.  Previously  defined  dominant  mutations  in  the  KAP-1  corepressor  which  affect 
protein  interactions  with  effectors  of  repression  will  be  useful  in  dissecting  the  role  a  given  mechanism 
has  in  the  efficiency  of  gene  silencing  by  KAP-1.  The  results  from  these  studies  can  be  used  as  a 
framework  to  test  gene  silencing  of  endogenous  KRAB-ZFP  target  genes.  In  particular,  we  will  focus 
our  efforts  on  ZBRK1,  a  BRCA1  associated  KRAB-ZFP,  and  its  role  in  the  regulation  of  the  GADD45 
gene  (17). 

The  proposed  research  for  the  final  year  of  this  fellowship  is  innovative,  because  its  applies  the 
well  characterized  KRAB:  KAP-1  repression  system  to  evaluate  the  function  of  the  PHD  finger,  an 
abundant  amino  acid  motif  in  putative  transcriptional  regulatory  proteins,  and  specific  effects  of 
targeted  transcriptional  silencing  at  a  single  copy  genomic  loci.  It  is  our  expectation  that  the  proposed 
research  will  define  molecular  events  within  promoter  elements  that  occur  during  transcriptional 
silencing.  These  results  will  be  significant,  because  the  knowledge  of  transcriptional  gene  silencing 
mechanics  will  provide  new  targets  for  therapeutic  intervention  of  de  novo  silenced  genes  in  cancer.  In 
addition,  it  is  expected  that  the  results  of  this  proposal  will  fundamentally  advance  the  field  of  covalent 
histone  modifications  and  the  biochemistry  of  transcriptional  repression  by  the  KRAB  zinc  finger 
protein  superfamily. 
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Key  research  accomplishments 

•  Described  an  autonomous  DNA-binding  dependent  repression  activity  conferred  by  the  PHD  finger 
and  bromodomain  of  KAP-1. 

•  Confirmed  an  in  vivo  association  between  the  endogenous  corepressor  KAP-1  and  the  Mi-2a 
subunit  of  the  histone  deacetylase  complex,  NuRD. 

•  Demonstrated  that  the  in  vivo  association  between  KAP-1  and  Mi-2a  requires  the  PHD  finger  and 
bromodomain  of  KAP-1 . 

•  Solved  the  first  solution  structure  of  the  KAP-1  PHD  finger. 

•  Using  the  KAP-1  PHD  finger  structure,  characterized  the  chemical  and  physical  effect  of  spatially 
conserved  disease  relevant  mutations. 


Reportable  Outcomes: 

Publications 

Peng  H,  Begg  GE,  Schultz  DC,  Friedman  JR,  Jensen  DE,  Speicher  DW,  Rauscher  III  FJ. 
Reconstitution  of  the  KRAB -KAP-1  Repressor  Complex:  A  Model  System  for  Defining  the  Molecular 
Anatomy  of  RING-B-box-Coiled-Coil  Domain-Mediated  Protein-Protein  Interactions.  J  Mol.  Bio.  5: 
1139-1162  (2000). 


Manuscripts 

Schultz  DC,  Friedman  JR,  Rauscher  III  FJ  Targeting  Histone  Deacetylase  Complexes  via  KRAB-Zinc 
Finger  Proteins:  The  PHD  and  Bromodomains  of  KAP-1  form  a  Cooperative  Repression  Domain  that 
Recruits  the  Mi-2a  Subunit  of  the  NuRD  complex.  (Submitted,  Genes  and  Development,  2000) 

Capili  AD,  Schultz  DC,  Rauscher  III  FJ,  and  Borden  KLBSolution  structure  of  the  PHD  domain 
from  the  KAP-1  corepressor:  Structural  determinants  for  PHD,  RING,  and  LIM  zinc-binding  domains. 
(Submitted,  EMBO,  2000) 

Abstracts 

*Schultz  DC,  Jensen  DE,  Peng  H,  Rauscher  FJ,  III.  The  BRCA1  and  KAP-1  RING  fingers:  model 
domains  for  studying  the  function  of  transcriptional  coactivator  and  corepressor  proteins.  The  Era  of 
Hope  Breast  Cancer  Meeting,  June  9,  2000. 

Schultz  DC,  Capili  AD,  Borden  KLB,  Rauscher  III  FJ.  The  KRAB-ZFP:KAP-1  Repression  System 
Recruits  the  Histone  Deacetylase  Complex,  NuRD,  via  an  Interaction  Between  the  PHD  and 
Bromodomains  of  KAP1  and  the  Mi-2  Subunit.  (Submitted  Kestone  Symposia-C5,  2001) 


•  Indicates  platform  presentation  given  by  David  C.  Schultz 
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Conclusions: 

The  support  fellowship  has  provided  the  fundamental  training  in  both  molecular  biology  and 
biochemistry  techniques.  The  training  has  allowed  new  hypotheses  to  be  formulated,  based  on 
completed  studies  and  current  work,  to  extend  this  research  in  the  next  logical  direction.  Moreover  the 
this  fellowship  training  has  provided  the  intellectual  framework  for  the  design  of  new  research  projects 
which  address  important  questions  pertaining  to  breast  cancer. 
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THE  BRCA1  AND  KAP-1  RING  FINGERS:  MODEL  DOMAINS  FOR 
STUDYING  THE  FUNCTIONS  OF  TRANSCRIPTIONAL  COACTIVATOR  AND 

COREPRESSOR  PROTEINS 


David  C.  Schultz,  David  E.  Jensen,  Hongzhuang  Peng,  and 
Frank  J.  Rauscher,  III 

The  Wistar  Institute,  3601  Spruce  Street,  Philadelphia,  PA  19104 
Email:  dshultz@wistar.upenn.edu 

The  recurrent  theme  that  has  emerged  from  the  structural  study  of  proteins 
involved  in  signal  transduction  and  transcriptional  regulation  is  that  multiple 
independently  folded  globular  domains  in  these  proteins  often  cooperate  in 
macromolecular  recognition.  The  presence  of  highly  conserved  signature  amino  acid 
sequence  motifs  and  their  spatial  organization  within  a  novel  protein  is  often  the  first  clue 
to  its  biochemical  function.  Furthermore,  the  identification  of  interacting  proteins  has 
been  beneficial  in  defining  the  biochemical  function  of  a  particular  protein  and/or  the 
cellular  pathway  (i.e.  cell  signaling,  transcription,  DNA  repair,  protein  degradation)  in 
which  it  functions.  We  have  initially  used  the  RING  finger  motif  as  a  model  domain  for 
studying  protein  function.  The  RING  finger  motif  is  a  very  abundant  sequence  motif  in 
the  protein  database.  Upon  cloning  of  BRCA1,  the  most  recognizable  signature  motif  in 
the  amino  acid  sequence  was  a  RING  finger  at  the  N-terminus.  Moreover,  genetic 
analysis  of  breast  cancer  kindred’s  identified  naturally  occurring  germline  mutations  that 
lead  to  single  amino  acid  substitutions  of  highly  conserved  residues  in  the  RING  finger 
and  predispose  individuals  to  disease.  We  have  identified  the  novel  protein,  BAP-1,  that 
specifically  interacts  with  the  RING  finger  of  BRCA1,  but  not  naturally  occurring  RING 
mutants.  These  initial  studies  have  expanded  to  understanding  the  function  of  the  RING 
finger  motif  in  the  context  of  a  tripartite  RBCC  domain  (RING-B-box-coiled-coil),  a 
highly  conserved  spatial  arrangement  of  signature  motifs  found  in  a  number  of 
oncoproteins,  including  PML1  and  MIDI.  We  have  found  that  each  sub-domain  of  this 
tripartite  motif  of  KAP-1,  a  transcriptional  corepressor  protein,  is  essential  for  interaction 
with  its  ligand,  and  each  of  these  structural  elements  appears  to  be  vital  for  proper  protein 
oligomerization.  These  studies  will  have  widespread  implications  in  future  studies  aimed 
at  determining  the  ligands  for  RBCC  containing  proteins.  The  PHD  finger  motif  is  a 
second  autonomous  signature  motif,  which  significantly  resembles  the  spatial 
arrangement  of  the  RING  finger.  Similar  to  the  RING  finger,  naturally  occurring 
mutations  in  the  PHD  finger  of  the  hATRX  protein  confer  a  susceptibility  to 
developmental  defects.  In  this  regard,  synthetically  derived  mutations  in  the  PHD  finger 
of  KAP-1  disrupt  its  ability  to  optimally  repress  transcription.  Ultimately,  the  detailed 
analysis  of  a  protein’s  structure,  biochemical  function,  or  cellular  pathway  may  assist  in 
the  design  and  development  of  better  therapeutic  strategies  for  the  intervention  of  disease. 

The  U.  S.  Army  Medical  Research  and  Material  Command  under  DAMD  17-98-1-8269 
supported  this  work. 


The  KRAB-ZFP:KAP-1  Repression  System 
Recruits  the  Histone  Deacetylase  Complex,  NuRD,  via  an 
Interaction  Between  the  PHD  and  Bromodomains  of  KAP1 
and  the  Mi-2  Subunit. 

David  C.  Schultz1,  Allan  D.  Capili2,  Katherine  L.  B.  Borden2, 
and  Frank  J.  Rauscher  III1.  lThe  Wistar  Institute,  Philadelphia 
PA.  2Mount  Sinai  School  of  Medicine,  New  York,  NY. 

The  PHD  finger  and  bromodomain  are  two  highly  conserved 
protein  motifs  found  in  proteins  with  transcriptional  regulatory 
functions.  Naturally  occurring  mutations  in  the  PHD  finger 
contribute  to  a  variety  of  human  diseases  including  ATRX 
syndrome,  myeloid  leukemias,  and  autoimmune  dysfunction. 
Optimal  transcriptional  repression  by  KAP-1,  a  universal 
corepressor  for  the  KRAB-ZFP  superfamily  of  transcriptional 
repressors,  requires  an  intact  PHD  finger  and  bromodomain  at 
its  COOH-terminus.  Site-directed  mutations  in  the  KAP-1 
PHD  finger  or  the  bromodomain  specifically  disrupt  the 
repression  function  of  this  bi-partite  domain.  We  have 
determined  a  solution  structure  for  the  PHD  finger  of  KAP-1. 
Our  studies  reveal  that  the  PHD  domain  binds  zinc  in  a  cross¬ 
brace  topology  reminiscent  of  RING  domains. 
Mechanistically,  the  KAP-1  PHD  and  bromodomains  are 
required  to  mediate  an  interaction  with  the  Mi-2a  subunit  of 
the  NuRD  in  vivo  as  determined  by  co-immunoprecipitation. 
These  data  indicate  that  the  PHD  finger  and  bromodomain 
provide  an  interface  for  protein-protein  interactions. 
Moreover,  the  data  suggest  the  KRAB-ZFP: KAP-1  repression 
complex  functions  to  target  the  histone  deacetylase  activity  of 
the  NuRD  complex  to  specific  gene  promoters  in  vivo.  This 
work  was  funded  in  part  by  DAMD  17-98-1-8269  to  DCS. 
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Reconstitution  of  the  KRAB-KAP-1  Repressor 
Complex:  A  Model  System  for  Defining  the  Molecular 
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Protein-protein  Interactions 
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Protein-protein  Interactions 

Hongzhuang  Peng,  Gillian  E.  Begg,  David  C.  Schultz,  Josh  R.  Friedman 
David  E.  Jensen,  David  W.  Speicher  and  Frank  J.  Rauscher  III* 

The  Wistar  Institute ,  3601  The  KRAB  domain  is  a  75  amino  acid  residue  transcriptional  repression 

Spruce  Street,  Philadelphia,  PA  module  commonly  found  in  eukaryotic  zinc-finger  proteins.  KRAB- 
19104,  USA  mediated  gene  silencing  requires  binding  to  the  corepressor  KAP-1.  The 

KRAB:KAP-1  interaction  requires  the  RING-B  box-coiled  coil  (RBCC) 
domain  of  KAP-1,  which  is  a  widely  distributed  motif,  hypothesized  to  be 
a  protein-protein  interface.  Little  is  known  about  RBCC-mediated  ligand 
binding  and  the  role  of  the  individual  sub-domains  in  recognition  and 
specificity.  We  have  addressed  these  issues  by  reconstituting  and  character¬ 
izing  the  KRAB:KAP-1-RBCC  interaction  using  purified  components.  Our 
results  show  that  KRAB  binding  to  KAP-1  is  direct  and  specific,  as  the 
related  RBCC  domains  from  TIFla  and  MIDI  do  not  bind  the  KRAB 
domain.  A  combination  of  gel  filtration,  analytical  ultracentrifugation, 
chemical  cross-linking,  non-denaturing  gel  electrophoresis,  and  site- 
directed  mutagenesis  techniques  has  revealed  that  the  KAP-1 -RBCC  must 
oligomerize  likely  as  a  homo-trimer  in  order  to  bind  the  KRAB  domain. 
The  RING  finger,  B2  box,  and  coiled-coil  region  are  required  for  oligomeri¬ 
zation  of  KAP-l-RBCC  and  KRAB  binding,  as  mutations  in  these  domains 
concomitantly  abolished  these  functions.  KRAB  domain  binding  stabilized 
the  homo-oligomeric  state  of  the  KAP-l-RBCC  as  detected  by  chemical 
cross-linking  and  velocity  sedimentation  studies.  Mutant  KAP-l-RBCC 
molecules  hetero-oligomerize  with  the  wild-type  KAP-1,  but  these  com¬ 
plexes  were  inactive  for  KRAB  binding,  suggesting  a  potential  dominant 
negative  activity.  Substitution  of  the  coiled-coil  region  with  heterologous 
dimerization,  trimerization,  or  tetramerization  domains  failed  to  recapitu¬ 
late  KRAB  domain  binding.  Chimeric  KAP-l-RBCC  proteins  containing 
either  the  RING,  RING-B  box,  or  coiled-coil  regions  from  MIDI  also  failed 
to  bind  the  KRAB  domain.  The  KAP-l-RBCC  mediates  a  highly  specific, 
direct  interaction  with  the  KRAB  domain,  and  it  appears  to  function  as  an 
integrated,  possibly  cooperative  structural  unit  wherein  each  sub-domain 
contributes  to  oligomerization  and/or  ligand  recognition.  These  obser¬ 
vations  provide  the  first  principles  for  RBCC  domain-mediated  protein- 
protein  interaction  and  have  implications  for  identifying  new  ligands  for 
RBCC  domain  proteins. 
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Introduction 

The  recurrent  theme  that  has  emerged  from  the 
structural  study  of  proteins  involved  in  signal 
transduction  and  transcriptional  regulation  is  that 
multiple  independently  folded  globular  domains  in 
these  proteins  often  cooperate  in  macromolecular 
recognition.  These  domains  are  often  recognizable 
by  conserved  signature  amino  acid  sequence  motifs 
and  their  organization  within  a  protein  often  is  the 
first  clue  about  protein  function.  The  bHLH  and 
bZIP  DNA-binding  proteins  have  provided  an 
important  paradigm  for  protein  multimerization 
and  macromolecular  recognition:  each  protein 
encodes  adjacent,  functionally  separable  dimeriza¬ 
tion  and  DNA-binding  regions  (Baxevanis  & 
Vinson,  1993;  Hurst,  1995).  The  dimerization  func¬ 
tion,  which  is  mediated  by  a  coiled-coil  motif,  is 
required  to  correctly  orient  the  basic  regions  for 
interaction  with  DNA.  With  these  principles  in 
mind,  we  have  been  characterizing  the  RING-B 
box-coiled-coil  (RBCC)  family  of  nuclear  regulatory 
proteins,  which  have  been  predicted  to  contain 
functionally  separable  multimerization  and  protein 
recognition  domains  (Borden,  1998). 

The  RBCC  family  of  proteins  contains  the  tripar¬ 
tite  motif  that  most  likely  functions  as  a  coopera¬ 
tive  protein-protein  interaction  motif  (Borden, 
1998)  (Figure  1(a)  and  (b)).  The  definitive  element 
of  the  tripartite  motif  is  the  RING  finger,  which  is 
a  cysteine-rich  motif  of  the  form  C3HC4  and  binds 
two  molecules  of  zinc  in  a  unique  cross-braced  lig¬ 
ation  system  (Barlow  et  al,  1994;  Bellon  et  ah,  1997; 
Borden  et  ah,  1995a).  The  RING  finger  is  found 
almost  exclusively  in  the  NH2-terminal  position  in 
RBCC  proteins  and  is  likely  to  contribute  either 
specificity  and/or  multimerization  properties  to 
the  tripartite  motif.  More  than  100  proteins  encode 
for  a  RING  finger  motif,  and  mutational  analyses 
have  confirmed  the  requirement  for  the  RING  fin¬ 
ger  for  proper  biological  function  (for  reviews,  see 
Borden,  1998;  Saurin  et  ah,  1996;  Boddy  et  ah,  1997; 
Cao  et  ah,  1997).  RING  finger  structures  suggest 
that  a  common  hydrophobic  core  is  formed  as  a 
result  of  zinc-chelation  and  that  the  variable 
sequence  between  the  conserved  ligation  residues 
provides  specificity  for  protein  recognition. 

The  second  signature  motif  of  the  RBCC  domain 
is  the  B-box,  which  is  also  a  cysteine-rich  zinc¬ 
binding  motif  of  the  form  CHC3H2  (Reddy  & 
Etkin,  1991)  (Figure  1(b)).  The  B-box  is  an  indepen¬ 
dently  folded  globular  domain.  Two  B-box  motifs 
are  often  found  immediately  carboxy-terminal  to 
the  RING  finger  in  the  RBCC  domain.  The  NMR 
structure  of  the  XNF7  B-box  showed  that  only  one 
zinc  atom  was  bound  with  the  other  potential  che¬ 
lation  residues  unoccupied  (Borden  et  ah,  1995b). 
XNF7  is  required  for  embryonic  development  via  a 
chromatin-binding  function  that  requires  the  integ¬ 
rity  of  the  B-box  (Bellini  et  ah,  1995). 

The  third  RBCC  signature  motif  is  a  coiled-coil 
domain  that  is  comprised  of  appropriately  spaced 
hydrophobic  residues  predicted  to  form  an 


extended  a-helical  region  (for  a  review,  see  Lupas, 

1996)  (Figure  1(b)).  Multiple  individual  regions 
within  the  predicted  coiled-coil  region  display 
amphipathic  character.  Moreover,  many  of  the  bio¬ 
logical  properties  of  RBCC  proteins  have  been 
shown  to  be  dependent  on  multimerization  via  this 
coiled-coil  region  (Saurin  et  ah,  1996). 

Thus,  RBCC  proteins  are  defined  by  this  tripar¬ 
tite  motif,  which  is  found  in  a  spatially  restricted 
organization  in  each  protein.  Clues  to  the  function 
and/or  potential  ligands  for  RBCC  proteins  have 
come  mainly  from  the  studies  of  their  biological 
properties.  Most  notably,  RBCC  proteins  have  been 
defined  as  potential  proto-oncogenes:  RFP,  TIFla, 
and  PML  (Goddard  et  ah,  1991;  Kakizuka  et  ah, 
1991;  Le  Douarin  et  ah,  1995;  Miki  et  ah,  1991; 
Takahashi  et  ah,  1985).  The  RBCC  domain  of  each 
of  these  proteins  is  fused  to  heterologous  proteins 
(as  a  result  of  chromosomal  translocation)  and 
results  in  the  creation  of  oncogenes.  The  RBCC 
domain  of  PML  protein  is  fused  to  the  retinoic  acid 
receptor  alpha  (RARa)  as  a  result  of  the  t(15:17) 
translocation  (Borden  et  ah,  1996;  Goddard  et  ah, 
1991;  Kakizuka  et  ah,  1991),  resulting  in  aberrant 
regulation  of  RAR  target  genes  and  a  block  in  mye¬ 
locytic  differentiation  (Grignani  et  ah,  1996).  The 
PML-RARa  fusion  homodimerizes  and  heterodi- 
merizes  with  the  wild-type  PML  and  RXR:  both  of 
these  activities  require  the  intact  coiled-coil  region 
of  the  RBCC  domain  (Dyck  et  ah,  1994;  Perez  et  ah, 
1993;  Weis  et  ah,  1994).  Multimerization  appears  to 
sequester  wild-type  RAR/ RXR  receptors  into  dis¬ 
crete  subnuclear  domains  termed  ND10  or  PODs 
(for  a  review,  see  Maul,  1998;  Sternsdorf  et  ah, 

1997) .  The  wild-type  PML  protein  functions  as  a 
growth  suppressor  via  induction  of  apoptosis 
(Borden  et  ah,  1997).  This  activity  requires  both 
localization  to  NDlO/PODs  and  cooperation 
between  the  RING,  B-boxes  and  coiled-coil  motifs. 
Thus,  in  this  biological  context,  the  PML-RBCC 
domain  functions  as  a  cooperative  unit  that 
presumably  mediates  specific  protein-protein 
interactions. 

The  Kruppel  associated  box  (KRAB)  domain  was 
originally  identified  as  a  conserved  motif  at  the 
NH2  terminus  of  zinc-finger  proteins  (ZFPs) 
(Bellefroid  et  ah,  1991)  and  was  then  shown  to  be  a 
potent,  DNA-binding-dependent  transcriptional 
repression  module  (Margolin  et  ah,  1994;  Pengue 
et  ah,  1995;  Vissing  et  ah,  1995;  Witzgall  et  ah, 
1994).  The  KRAB  domain  homology  consists  of 
approximately  75  amino  acid  residues  and  is  pre¬ 
dicted  to  fold  into  two  amphipathic  helices 
(Figure  1(c)).  The  minimal  repression  module  is 
approximately  45  amino  acid  residues,  and  substi¬ 
tutions  for  conserved  residues  abolish  repression 
(Margolin  et  ah,  1994).  More  than  ten  indepen¬ 
dently  encoded  KRAB  domains  have  been  demon¬ 
strated  to  be  potent  repressors,  suggesting  that  this 
activity  is  a  common  property.  Our  laboratory 
purified  and  cloned  KAP-1  as  a  protein  that  binds 
to  the  KRAB  repression  domain  (Friedman  et  ah, 
1996).  Structurally,  KAP-1  possesses  all  the 
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signature  motifs  for  an  emerging  superfamily  of 
transcriptional  regulators,  including  TIFla  and 
TIFly  (Friedman  et  al,  1996;  Le  Douarin  et  ah, 
1995;  Venturini  et  al,  1999)  (Figure  1(a)).  Further 
studies  indicated  that  KAP-1  functions  as  a  univer¬ 
sal  corepressor  for  KRAB  domain  proteins 
(Friedman  et  al,  1996).  KAP-1  is  a  97  kDa  nuclear 
phosphoprotein  that  possesses  a  canonical  RBCC 
finger  in  the  NH2  terminus,  and  appears  to  be  both 
necessary  and  sufficient  for  the  KRAB  domain 
binding.  The  carboxy  terminus  of  KAP-1  includes  a 
PHD  finger  and  a  bromodomain,  and  this  region 
functions  as  a  repressor  when  tethered  to  DNA 
using  a  heterologous  DNA-binding  domain,  and 
mutations  in  either  domain  weaken  this  repression 
activity  (unpublished  results). 

It  has  been  firmly  established  that  KAP-1  is 
required  for  KRAB  domain-mediated  repression. 
The  evidence  includes:  (1)  KAP-1  binds  to  multiple 
KRAB  repression  domains  both  in  vitro  and  in  vivo; 
(2)  KRAB  domain  mutations  that  abolish  repres¬ 
sion  decrease  or  eliminate  KAP-1  binding;  (3)  over¬ 
expression  of  KAP-1  enhances  KRAB-mediated 
repression;  (4)  the  KRAB  domain  does  not  repress 
in  cells  that  lack  KAP-1.  These  results  support  a 
model  in  which  KRAB-ZFPs  bind  a  gene  and 
repress  transcription  of  the  gene  by  recruiting  the 
KAP-1  corepressor  via  the  RBCC  domain.  Thus,  the 
KRAB-KAP-1  interaction  serves  as  a  specific 
example  from  which  to  dissect  molecular  determi¬ 
nants  for  RBCC-ligand  interactions.  We  have 
reconstituted  the  KRAB-KAP-1  complex  using  pur¬ 
ified  components  and  have  employed  a  compre¬ 
hensive  set  of  biochemical  approaches  to  analyze 
this  interaction.  Our  data  indicate  that  KAP-1- 
RBCC  oligomerization  is  required  for  KRAB 
domain  binding  and  that  all  three  components  of 
the  tripartite  motif  cooperate  in  KRAB  recognition. 

Results 

Direct  interaction  between  the  KAP-1 -RBCC 
domain  and  the  KOX1-KRAB  domain 

Previous  studies  using  NH2-terminal  truncated 
KAP-1  proteins  have  suggested  that  the  RING  fin¬ 
ger,  B  boxes,  and  coiled-coil  structure  are  necessary 
for  binding  to  the  KRAB  domain  (Friedman  et  al, 
1996).  However,  all  of  these  studies  were  per¬ 
formed  using  indirect  protein-protein  interaction 
assays  either  in  complex  mixtures  in  vitro,  or  in 
yeast  two-hybrid  assays.  We  sought  to  reconstitute 
the  KRAB-KAP-1  interaction  using  purified  recom¬ 
binant  components.  The  proteins  were  overex¬ 
pressed  using  bacterial  or  baculovirus  systems  (see 
below)  and  purified  using  Ni-NTA  chromatog¬ 
raphy  to  near-homogeneity  (Figure  2(a)).  The  KAP- 
1-RBCC  proteins  produced  in  Escherichia  coli  were 
soluble  and  were  purified  under  non-denaturing 
conditions.  SDS-PAGE  analysis  (Figure  2(a)) 
revealed  that  the  KAP-1 -RBCC  protein  migrates 
with  an  apparent  molecular  mass  of  46  kDa,  a 
value  that  is  almost  identical  with  its  predicted 


molecular  mass  of  45.9  kDa.  The  KAP-1 -RBCC 
mutants  show  similar  degrees  of  purity  and 
migrate  identically  in  SDS-PAGE  (Figure  2(a)). 

The  KRAB  domain  that  we  have  utilized  in  these 
studies  corresponds  to  amino  acid  residues  1-90  of 
the  KOX1  zinc  finger  protein  (Figure  1(c)). 
Although  the  KRAB  domain  is  found  in  more  than 
100  independent  proteins,  we  selected  the  KRAB 
domain  from  KOX1  for  the  following  reasons:  (1) 
the  KOX1-KRAB  domain  was  originally  utilized  to 
isolate  the  KAP-1  corepressor,  and  mutations  in 
this  domain  that  concomitantly  abolish  repression 
and  KAP-1  binding  are  well  characterized 
(Friedman  et  al,  1996).  (2)  The  KOX1-KRAB 
domain  is  highly  expressed  in  E.  coli  autonomously 
or  as  a  GAL4  fusion  and  is  well  behaved  in  protein 
reconstitution  assays.  (3)  The  GAL4-KOX1-KRAB 
1-90  fusion  (designated  GAL4-KRAB)  is  a  potent, 
KAP-1  -dependent  and  DNA  binding-dependent 
transcriptional  repressor  in  vivo  (Margolin  et  al, 
1994).  The  wild-type  GAL4-KRAB  and  a  mutant 
version  (DV18,19AA,  which  lacks  repression 
activity)  were  expressed  and  purified  from  E.  coli 
(Figure  2(a)).  To  detect  whether  the  purified  KAP- 
1-RBCC  protein  was  able  to  form  a  stable  complex 
with  a  DNA-bound  KRAB  domain,  we  used  the 
electrophoretic  mobility  shift-supershift  assay 
(EMSA).  Binding  of  the  GAL4-KRAB  protein  to  a 
canonical  32P-labeled  synthetic  oligonucleotide  con¬ 
taining  the  GAL4  recognition  sequence  yielded  an 
expected  mobility  shift  (Figure  2(b),  lane  1).  When 
increasing  amounts  of  purified  KAP-l-RBCC  pro¬ 
tein  were  incubated  with  the  GAL4-KRAB  protein 
and  DNA,  a  new  mobility  shift  was  observed 
(Figure  2(b),  lanes  2-6).  This  supershift  contains 
the  ternary  complex  of  DNA:GAL4-KRAB:KAP-1- 
RBCC,  as  characterized  by  antibody  inhibition/ 
supershift  and  competition  with  unlabeled  compe¬ 
titor  DNA  (data  not  shown).  Formation  of  this 
complex  requires  the  GAL4-KRAB  DNA  binding 
subunit  since  no  shift  is  seen  with  KAP-l-RBCC 
alone  (data  not  shown).  The  GAL4-KRAB(DV-AA) 
protein,  which  shows  severely  impaired  repression 
function  in  vivo  readily  bound  to  2P-labeled  GAL4 
DNA  but  failed  to  generate  the  ternary  complex  in 
the  presence  of  increasing  amounts  of  KAP-l- 
RBCC  protein  (Figure  2(b),  lanes  7-12),  in  agree¬ 
ment  with  previous  reports  (Friedman  et  al,  1996). 
These  data  strongly  suggest  that  the  KAP-l-KRAB 
interaction  is  direct  and  highly  specific,  and  that 
the  RBCC  region  is  necessary  and  sufficient  for  this 
interaction. 

The  RING  finger,  B  boxes,  and  the  coiled-coil 
region  are  critical  for  the  formation  of  the 
KRAB: KAP-1  -RBCC  complex 

To  define  the  sub-domains  of  the  KAP-l-RBCC 
domain  integral  to  its  interaction  with  the  KRAB 
domain,  we  made  independent  mutations  in  the 
RING  finger,  B  boxes,  and  the  coiled-coil  region 
(Figure  1(d)).  Mutant  proteins  were  expressed  and 
purified  in  E.  coli  (Figure  2(a)),  and  analyzed  for 
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Figure  1  (legend  opposite) 
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Figure  1.  A  diagram  illustrating 
the  architecture  of  the  KAP-1/TIF1 
family  of  transcriptional  regulatory 
proteins  and  the  KOX1/KRAB-ZFP 
protein,  (a)  A  representation  of  pro¬ 
teins  containing  the  tripartite  RBCC 
motif,  including  hKAP-1,  hTIFla, 
hTIFly,  and  hMIDl.  The  numbers 
represent  amino  acid  positions. 
RING,  RING  finger;  B1  and  B2,  B- 
box;  TSS,  TIF1  signature  sequence 
(Venturini  et  al,  1999);  HP1BD, 
HP1  binding  domain  (Ryan  et  al, 
1999);  PHD,  plant  homeodomain; 
Bromo,  bromo  domain;  NRID, 
nuclear  receptor  interaction 
domain;  CTD,  carboxy-terminal 
domain.  The  column  to  the  right 
summarizes  known  protein-protein 
interactions:  NHR,  nuclear  hor¬ 
mone  receptor;  HP1,  heterochroma¬ 
tin  protein-1,  (b)  Amino  acid 
alignment  of  the  RING  finger,  B 
boxes,  and  coiled-coil  region  of  the 
hKAP-1,  hTIFla,  hTIFly,  and 
hMIDl  proteins.  The  alignments 
were  generated  using  the  CLUS- 
TAL  program.  The  amino  acid  resi¬ 
dues  that  are  identical  and/or 
conserved  in  all  four  proteins  are 
in  bold.  The  asterisks  below  indi¬ 
cate  the  amino  acids  targeted  for 
mutational  analysis  in  this  study. 
The  periods  represent  spaces  intro¬ 
duced  to  obtain  maximal  align¬ 
ment.  The  spacing  between  the  62- 
box  and  the  first  coil  of  KAP-1  is 
15  amino  acid  residues.  Within  the 


coiled-coil  domain,  hydrophobic  amino  acid  residues  defining  the  heptad  repeats  of  each  putative  coil  are  highlighted 
in  black.  The  relative  position  of  the  first  and  the  second  predicted  coils  of  the  coiled-coil  region  are  indicated  by  the 
gray  bars  above  the  sequence  alignment.  The  numbers  at  the  left  refer  to  the  amino  acid  positions  in  the  correspond¬ 
ing  proteins.  Database  accession  numbers:  hKAP-1  (998813);  hTIFla  (AF119042);  hTIFly  (AF1 19043);  hMIDl  (Y13667). 
(c)  A  representation  of  the  KOX1-ZFP  and  recombinant  derivatives  used  in  this  study.  The  amino  acid  sequence  of 
the  minimal  KRAB  domain  of  KOX1  is  illustrated.  The  position  of  the  DV18,19AA  mutation,  which  disrupts  KRAB- 
mediated  repression  and  association  with  KAP-1,  is  indicated  (Margolin  et  al,  1994;  Friedman  et  al,  1996).  The  KRAB 
domain  of  KOX1,  residues  1-90,  was  expressed  without  a  DNA-binding  domain  and  as  a  heterologous  fusion  protein 
with  the  DNA-binding  domain  of  GAL4,  residues  1-147.  (d)  A  diagram  of  the  KAP-l-RBCC  domain  (wild-type  and 
mutants)  and  organization  of  heterologous  KAP-l-RBCC  fusion  proteins.  The  KAP-l-RBCC  (CC65,68AA)  is  a  double 
substitution  in  the  RING  finger.  The  KAP-l-RBCC  (CCC152,153,156AAA)  is  a  triple  substitution  in  the  B1  box.  The 
KAP-l-RBCC  (C209A,H212A)  is  a  double  substitution  in  the  B2  box.  The  KAP-l-RBCC  (V260N),  KAP-l-RBCC 
(V260D),  and  KAP-l-RBCC  (L270P)  are  single  substitutions  in  the  first  predicted  coil  of  the  coiled-coil  region.  The 
KAP-l-RBCC  (L306P)  is  a  single  substitution  in  the  second  predicted  coil  of  the  coiled-coil  region.  The  TIFla-RBCC 
domain  consists  of  amino  acid  residues  1-414  of  the  TIFla  protein.  The  MID1-RBCC  domain  consists  of  residues  1- 
366  of  the  MIDI  protein.  The  RBKAP-CCMID  is  a  fusion  between  the  RING  finger,  B  boxes  of  KAP-1  (residues  22-254) 
and  the  coiled-coil  region  of  MIDI  (residues  220-366).  The  RKAP-BCCMID  is  a  fusion  between  the  RING  finger  of  KAP- 
1  (residues  22-138)  and  the  B  boxes,  and  coiled-coil  region  of  MIDI  (residues  72-366).  The  Rmid^CCj^p  is  a  fusion 
between  the  RING  finger  of  MIDI  (residues  1-71)  and  the  B  boxes,  and  coiled-coil  region  of  KAP-1  (residues  139- 
418).  The  KAP-RB- [HSFTri],  KAP-RB-[GCN4Dim],  and  KAP-RB- [GCN4Tet]  are  heterologous  fusions  between  residues 
22-254  of  KAP-1  and  the  trimerization  domain  of  HSF  (residues  321-441),  the  leucine  zipper  of  GCN4  (residues  253- 
281),  and  the  synthetic  tetramerization  domain  of  GCN4  (residues  250-281).  A  summary  of  the  KRAB  binding  poten¬ 
tial  is  indicated  by  plus  signs  at  the  right. 


KRAB  domain  binding  (Figure  3).  Our  initial  muta¬ 
tional  strategy  was  to  target  the  critical  metal  che¬ 
lation  residues.  For  the  RING  finger  and  B-boxes, 
we  mutated  the  potential  Zn  chelation  residues  in 


pairs  (or  multiples)  to  lessen  the  possibility  that 
alternative  chelation  residues  from  adjacent 
sequence  could  be  recruited  to  restore  metal  bind¬ 
ing.  A  double  substitution  (CC65,68AA)  was  made 
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Figure  2.  Purification,  and  func¬ 
tional  analysis  of  recombinant  pro¬ 
teins  expressed  in  E.  coli.  (a)  A 
Coomassie  blue-stained  SDS /poly¬ 
acrylamide  gel  of  the  purified 
recombinant  KOX1-KRAB,  KAP-l- 
RBCC,  and  GAL4-KRAB  fusion 
proteins.  From  3  to  5  fig  of  the  pur¬ 
ified  KOXl-KRAB,  KAP-l-RBCC, 
and  TIFla-RBCC  proteins  was 
loaded  per  a  lane.  1  jig  of  GAL4- 
KRAB  (wild-type  and  DV-AA 
mutant)  protein  was  loaded  per 
lane.  The  arrow  indicates  the 
recombinant  KOX1-KRAB  protein. 
The  arrowhead  indicates  the  var¬ 
ious  KAP-l-RBCC  and  TIFla-RBCC 
proteins,  (b)  Binding  of  purified 
bacterial  KAP-l-RBCC  to  a  wild- 
type  KRAB  domain  but  not  a 
mutant  form  as  detected  by  EMSA. 
EMSA  was  performed  as  described 
in  Materials  and  Methods.  Each 
reaction  contained  a  constant 
amount  of  purified  GAL4-KRAB 
(100  ng),  or  GAL4-KRAB  (DV-AA) 
(100  ng)  (a),  a  32P-labeled  DNA 
probe  containing  the  canonical 
GAL4  binding  site,  and  increasing 
amounts  of  purified  bacterial  KAP- 
l-RBCC  (a).  The  DNA:protein  com¬ 
plexes  were  resolved  in  non-dena¬ 
turing  polyacrylamide  gels.  The 
open  arrow  indicates  the  DNA:- 
GAL4-KRAB  binary  complex.  The 
slower  migrating  DNA:GAL4- 
KRAB:KAP-1-RBCC  complex  is 
represented  by  the  bracket.  FP,  free 
probe. 


at  two  conserved  cysteine  residues  in  the  consen¬ 
sus  C3HC4  signature  (Figure  1(b))  that  have  been 
shown  to  be  critical  for  both  coordination  of  zinc 
and  for  stabilization  of  the  hydrophobic  core 


(Saurin  et  ah,  1996).  Mutations  in  the  RING  finger 
completely  abolished  the  ability  of  the  KAP-l- 
RBCC  domain  to  interact  with  the  KRAB  domain 
(Figures  1(d)  and  3(a)  and  (c)). 
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The  second  structural  element  in  the  RBCC  tri¬ 
partite  motif  is  the  B  box,  which  is  characterized 
by  a  signature  spacing  of  cysteine  and  histidine 
residues  (Borden,  1998)  (Figure  1(b)).  Surprisingly, 
a  triple  substitution  (CCC  152,153,156  AAA)  in  the 
NH2-terminal  B  box  had  no  affect  on  the  KAP-1  - 
RBCC  binding  to  the  KRAB  domain  (Figures  1(d) 
and  3(c)).  Flowever,  a  double  substitution 
(C209A,H212A)  in  the  second  B  box  (Figure  1(b)) 
strongly  reduced  the  KAP-l-RBCC  and  KRAB 
domain  interaction  (Figures  1(d)  and  3(c)). 

The  coiled-coil  domain  of  KAP-1  is  predicted  to 
fold  into  two  potential  leucine  zipper-like  motifs 
(Wolf  et  al,  1997)  (Figures  1(b)  and  3(b)).  Each 
region  is  likely  to  be  an  amphipathic  a-helix,  hav¬ 
ing  a  seven-residue  repeat  (a.b.c.d.e.f.g)„,  with 
hydrophobic  residues  at  positions  a  and  d  and 
polar  residues  at  the  other  positions  (Figure  3(b)). 
We  altered  the  leucine  zipper  motif  by  substituting 
a  proline  residue  for  a  hydrophobic  amino  acid, 
which  would  be  predicted  to  disrupt  the  a-helix 
structure.  The  L270P  mutation  at  position  d  in  coil 
I  shows  an  intermediate  effect  on  the  KRAB:KAP-1 
interaction  (Figures  1(d)  and  3(a)  and  (c)).  The 
L306P  mutation  at  position  d  in  coil  II  was  found 
to  completely  disrupt  the  KRAB:KAP-1  interaction 
(Figures  1(d)  and  3(c)).  We  also  tested  whether  the 
substitution  of  a  polar  amino  acid  for  a  hydro- 
phobic  amino  acid  at  position  a  would  interfere 
with  the  KRAB:KAP-1  interaction  (Figure  3(b)). 
However,  change  of  Val260  to  either  Asn  or  Asp 
displayed  little  effect  on  the  KRAB:KAP-1-RBCC 
interaction  (Figures  1(d)  and  3(c)).  Together,  the 
above  data  suggest  that  an  intact  RING  finger,  B 
boxes,  and  the  coiled-coil  domain  are  each  import¬ 
ant  for  the  stable  binding  of  the  KAP-l-RBCC  to  a 
KRAB  domain. 

Since  TIFla  is  40  %  identical  with  KAP-1  within 
the  RBCC  domain  (Figure  1(b)),  we  investigated 
whether  the  RBCC  domain  of  TIFla  could  interact 
with  the  KRAB  domain.  We  expressed  and  puri¬ 
fied  the  RBCC  domain  of  TIFla  (residues  1-414) 
(Figure  2(a)).  The  RBCC  domain  of  TIFla  dis¬ 
played  no  binding  to  the  KRAB  domain 
(Figure  3(c)),  an  observation  that  is  in  contrast  with 
a  previous  report  (Moosmann  et  al ,  1996).  It  is 
possible  that  the  KRAB-TIFla  interaction  reported 
by  Moosmann  was  indirect,  as  it  was  detected  only 
in  yeast  two-hybrid  assays  and  was  independent 
of  the  RING  finger  and  B  boxes. 

KAP-l-RBCC  exists  in  an  oligomeric  state 

To  investigate  the  mechanisms  behind  the 
KRAB:KAP-1  interaction,  we  employed  biochemi¬ 
cal  and  biophysical  analyses  of  the  KRAB  domain, 
the  KAP-l-RBCC  domain,  and  the  complex.  Gel- 
filtration  chromatography  was  used  to  estimate  the 
native  hydrodynamic  size  (Stokes  radius)  of  the 
KAP-l-RBCC  protein,  which  eluted  with  an  appar¬ 
ent  molecular  mass  of  158  kDa  (Figure  4(a)-I).  This 
value  is  roughly  three  times  the  molecular  mass  of 
the  KAP-l-RBCC  monomer,  suggesting  that  the 


KAP-l-RBCC  domain  is  an  apparent  trimer.  This 
result  is  consistent  with  multi-coil  scoring  analysis 
(Wolf  et  al,  1997)  for  the  KAP-1  coiled-coil  region. 

To  determine  the  oligomeric  state  of  the  KAP-l- 
RBCC  in  the  presence  of  the  native  KRAB  domain, 
a  KRAB:KAP-1-RBCC  complex  was  pre-formed  in 
solution,  and  then  subjected  to  gel  filtration.  This 
experiment  required  purified  KOX1-KRAB  protein 
from  E.  coli,  which  initially  remained  profoundly 
insoluble.  We  also  attempted  to  express  the  KRAB 
domains  from  ZNF133,  ZNF140,  ZNF141,  KRK-1, 
and  EEK-1,  which  also  proved  insoluble.  We 
finally  succeeded  in  producing  soluble  KOX1- 
KRAB  protein  by  employing  a  protocol  that  refolds 
the  protein  while  bound  to  the  Ni-NTA  resin  (Shi 
et  al,  1997).  The  KOX1-KRAB  domain  was  purified 
to  greater  than  98  %  purity  and  migrated  on  SDS- 
PAGE  with  an  apparent  molecular  mass  of 
15.5  kDa,  which  was  slightly  larger  than  the  pre¬ 
dicted  molecular  mass  of  13.9  kDa  (Figure  2(a)). 
Full  analysis  of  this  KOX1-KRAB  protein  will  be 
described  elsewhere.  However,  secondary  structure 
analysis  using  circular  dichroism  spectroscopy 
indicated  that  the  protein  was  properly  folded,  and 
the  protein  was  highly  active  in  binding  assays.  As 
shown  in  Figure  4(a)-III,  the  KAP-l-RBCC  domain 
in  KOXl-KRAB:KAP-l-RBCC  complex  eluted  from 
the  gel-filtration  column  as  a  distribution  of 
species,  most  likely  existing  in  a  complex  equili¬ 
brium,  where  the  majority  eluted  with  an  apparent 
molecular  mass  consistent  with  a  trimer.  Note  the 
slightly  larger  mass  observed  for  the  KAP-l-RBCC 
complexed  with  the  KRAB  domain  (fraction  11, 
Figure  4(a)-III)  compared  to  uncomplexed  KAP-l- 
RBCC  (fraction  12,  Figure  4(a)-I).  However,  other 
species  exist  and  these  can  co-elute  with  the  KRAB 
domain.  The  majority  of  the  KOX1-KRAB  protein 
eluted  at  670  kDa,  suggesting  that  this  protein 
aggregates  when  it  is  not  complexed  with  KAP-l- 
RBCC  (Figure  4(a)-II).  To  confirm  that  this  complex 
was  stable  during  chromatography,  the  peak  frac¬ 
tion  (11)  containing  the  complex  was  re-chromato¬ 
graphed  under  identical  conditions.  The  peak  of 
the  KOXl-KRAB:KAP-l-RBCC  complex  eluted 
exactly  as  the  previous  run,  with  an  apparent 
molecular  mass  slightly  larger  than  158  kDa 
(Figure  4(a)-IV).  We  conclude  that  (1)  the  KOX1- 
KRAB:KAP-1-RBCC  complex  can  be  readily 
formed  in  solution  and  is  stable  to  gel  filtration 
under  these  conditions;  (2)  there  is  a  small  but  sig¬ 
nificant  increase  in  the  Stokes  radius  of  the  com¬ 
plex  compared  with  KAP-l-RBCC  alone;  and  (3) 
the  KAP-l-RBCC  domain  exists  as  an  oligomer, 
probably  a  trimer  in  the  absence  or  presence  of  the 
KRAB  domain. 

We  next  used  velocity  sedimentation  in  sucrose 
gradients  to  estimate  the  native  mass  of  the  KOX1- 
KRAB:KAP-1-RBCC  complex  and  its  components. 
The  KAP-l-RBCC  protein  alone  sedimented  with 
an  apparent  mass  of  66  kDa  (Figure  4(b)-I), 
suggesting  either  that  the  oligomeric  state  of  the 
KAP-l-RBCC  in  solution  was  not  stable  to  the  con¬ 
ditions  of  sucrose  gradient  sedimentation,  or  that 
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the  protein  was  in  an  extended  configuration  that 
results  in  the  relatively  large  Stokes  radius 
observed  by  gel  filtration.  The  KOX1-KRAB  pro¬ 
tein  alone  sedimented  with  apparent  masses  of  670 
and  13.7  kDa  (Figure  4(b)-II).  However,  pre-for¬ 
mation  of  a  KOXl-KRAB:KAP-l-RBCC  complex 
prior  to  centrifugation  resulted  in  a  dramatic  shift 
in  the  sedimentation  of  KAP-l-RBCC  domain  to  an 
apparent  mass  of  approximately  158  kDa 
(Figure  4(b)-III).  The  KOX1-KRAB  domain  was 
detected  with  apparent  molecular  masses  of  670, 
158,  and  25  kDa,  most  likely  representing  aggre¬ 
gated,  KAP-l-RBCC-complexed,  and  monomeric  or 
dimeric  uncomplexed  KOX1-KRAB  protein, 
respectively.  This  sedimentation  analysis  supports 
the  conclusion  that  the  KAP-l-RBCC  is  oligomeric 
and  that  the  KRAB  domain  may  stabilize  the  oligo¬ 
merization  of  the  KAP-l-RBCC  under  these  sucrose 
gradient  conditions. 

We  next  utilized  non-denaturing-PAGE  (ND- 
PAGE)  to  estimate  the  native  size  of  KAP-l-RBCC. 
We  calculated  the  mobilities  of  the  protein  stan¬ 
dards  and  of  KAP-l-RBCC  relative  to  the  tracking 
dye  in  a  variety  of  polyacrylamide  gel  concen¬ 
trations  and  used  this  to  generate  plots  of  mobility 
versus  polyacrylamide  concentration.  The  slopes  of 
such  plots  represent  the  "retardation  coefficients'7 
of  the  proteins  (Bryan,  1977).  A  log-log  plot  was 
generated  in  which  the  negative  retardation  coeffi¬ 
cients  of  the  standards  were  plotted  against  the 
known  native  molecular  masses  of  the  standards 
(Figure  4(c)).  From  this  calibration  curve,  we  esti¬ 
mated  that  the  KAP-l-RBCC  has  a  native  size  of 
200  kDa  under  this  condition.  These  data  further 
substantiate  the  suggestion  that  the  KAP-l-RBCC 
protein  exists  as  an  oligomer. 

To  further  investigate  the  oligomeric  state  of  the 
KAP-l-RBCC  domain,  sedimentation  equilibrium 
experiments  were  performed  using  analytical  ultra¬ 
centrifugation.  For  each  experiment,  analyses  were 
performed  at  4°C  and  25  °C,  and  the  concentration 
of  protein  versus  radius  data  was  fitted  with  var¬ 
ious  models  of  self-association,  using  non-linear 
regression  (Johnson  et  al .,  1981).  The  KAP-l-RBCC 
protein  showed  non-ideal  behavior  under  these 


conditions.  For  two  independent  experiments,  each 
containing  three  samples  of  differing  KAP-l-RBCC 
protein  concentration,  the  data  were  best  described 
by  a  model  containing  monomer,  trimer,  and  hex- 
amer  terms  (Figure  5).  However,  when  multiple 
data  sets  were  fitted  simultaneously,  neither  the 
monomer-trimer  nor  the  monomer-hexamer  associ¬ 
ation  could  be  described  by  a  single  equilibrium 
constant,  indicating  that  these  associations  are  not 
reversible  on  the  time-scale  of  the  experiment.  The 
apparent  equilibrium  constants  describing  the 
monomer-trimer  and  the  monomer-hexamer 
association  increased  with  decreasing  protein  con¬ 
centration  loaded,  which  suggests  that  this  appar¬ 
ent  irreversibility  is  due  to  the  presence  of 
incompetent  trimer  and  hexamers,  i.e.  oligomers 
that  do  not  dissociate.  There  may  also  be  a  revers¬ 
ible  association,  however,  as  there  was  some  redis¬ 
tribution  of  the  species  (most  obviously  a 
reduction  in  the  proportion  of  hexamer)  when  the 
temperature  was  raised  from  4°C  to  25  °C  in  a 
single  experiment.  Unfortunately,  we  were  unable 
to  obtain  useful  data  starting  with  a  pre-formed 
KOXl-KRAB:KAP-l-RBCC  complex,  because  the 
protein  concentrations  of  the  complex  that  could 
be  obtained  were  not  high  enough  and  a  signifi¬ 
cant  portion  of  the  sample  formed  aggregates 
under  the  conditions  required  for  the  ultracentrifu¬ 
gation  experiments.  Nonetheless,  these  analyses 
suggest  that  KAP-l-RBCC  is  predominantly  a  tri¬ 
mer  in  solution  and  a  trimer-hexamer  equilibrium 
may  play  a  physiological  role. 

We  next  employed  chemical  cross-linking  ana¬ 
lyses  with  ethylene  glycol  bis  (succinimidylsucci- 
nate)  (EGS)  to  determine  the  oligomeric  state  of  the 
KAP-l-RBCC  proteins  (wild-type  and  mutants) 
(Figure  6).  Treatment  of  purified  KAP-l-RBCC  pro¬ 
tein  with  low  concentrations  of  EGS  yielded  four 
distinct  cross-linked  species  with  apparent  molecu¬ 
lar  masses  of  100,  135,  210,  and  270-300  kDa, 
potentially  corresponding  to  dimeric,  trimeric,  tet- 
rameric,  and  hexameric  forms  of  the  KAP-l-RBCC 
domain  (Figure  6(a)).  The  monomeric  form  of  the 
KAP-l-RBCC  domain  migrated  at  49  kDa  in  SDS- 
PAGE.  Treatment  of  a  pre-formed  KOX1-KRAB:- 


Figure  3.  Mutations  within  the  RING  finger,  B  boxes,  and  the  coiled-coil  region  of  the  KAP-l-RBCC  abolish  inter¬ 
action  with  the  KRAB  domain  in  vitro .  (a)  Binding  of  the  wild-type  KAP-l-RBCC  or  the  mutant  KAP-l-RBCC 
(CC65,68 A  A)  /  (L270P)  proteins  to  the  wild-type  GAL4-KRAB,  as  detected  by  EMSA.  Wild-type  KAP-l-RBCC  (1  pg) 
or  increasing  amounts  of  mutant  protein  (63,  125,  250,  500,  1000  ng)  (Figure  2(a))  were  incubated  with  the  wild-type 
GAL4-KRAB  protein.  The  open  arrow  indicates  DNA:GAL4-KRAB  mobility  shift.  The  bracket  indicates  the  DNA:- 
GAL4-KRAB:KAP-1-RBCC  ternary  complex,  (b)  An  illustration  of  the  putative  heptad  repeat  for  coil  I  (residues  253- 
294)  and  coil  II  (residues  299-322)  of  the  KAP-1  coiled-coil  region.  Note  the  potential  amphipathic  nature  of  the  puta¬ 
tive  helices.  The  heptad  positions  are  labeled  by  letters  a  through  g  in  the  helix,  where  hydrophobic  residues  at  pos¬ 
itions  a  and  d  might  constitute  the  cores  of  homo-trimeric  oligomerization  domain.  The  hydrophilic  residues  at  the 
other  positions  are  indicated  with  the  black  background.  The  two  leucine  residues  (L270  and  L306)  mutated  to  pro¬ 
line,  and  the  valine  residue  (V260)  mutated  to  asparagine  or  aspartic  acid  are  boxed,  (c)  The  interaction  between  the 
KRAB  domain  and  the  KAP-l-RBCC  domain  is  dependent  upon  the  integrity  of  the  RBCC  domain.  The  binding  of 
GAL4-KRAB  to  KAP-l-RBCC  or  various  mutant  KAP-l-RBCC  proteins,  as  detected  by  EMSA.  A  constant  amount  of 
GAL4-KRAB  (100  ng),  1  pg  of  each  KAP-l-RBCC  protein  (wild-type  or  mutant),  and  1  pg  of  TIFla-RBCC  protein  was 
added  to  each  reaction.  The  arrow  indicates  the  mobility  shift  for  the  GAL4-KRAB  protein.  The  bracket  indicates  the 
DNA:GAL4-KRAB:KAP-1-RBCC  ternary  complex.  FP,  free  probe. 
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KAP-l-RBCC  complex  with  low  concentration  of 
EGS  yielded  the  same  set  of  cross-linked  species  as 
was  observed  for  the  KAP-l-RBCC  domain.  How¬ 
ever,  a  cross-linked  species  consistent  with  the  for¬ 


mation  of  a  trimeric  KAP-l-RBCC  domain 
appeared  as  the  predominant  species  even  at  very 
low  levels  of  EGS  (Figure  6(b)).  These  data  suggest 
that  the  KRAB  domain  preferentially  binds  to  a 
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Figure  4.  Analyses  of  the  KAP-l-RBCC  motif,  the  KOX1-KRAB  domain,  and  a  KOXl-KRAB:KAP-l-RBCC  complex 
by  gel-filtration,  sucrose  gradient  sedimentation,  and  ND-PAGE.  (a)  Gel-filtration  analysis  of  KAP-l-RBCC  protein, 
KOX1-KRAB  protein,  and  a  pre-formed  KOXl-KRAB:KAP-l-RBCC  complex.  The  proteins  were  resolved  on  a  Super- 
dex  200  column.  The  fractions  were  collected  and  proteins  were  analyzed  by  SDS-PAGE.  The  fraction  numbers  are 
labeled  at  the  top  of  the  Figure.  The  migration  of  protein  standards  is  indicated  above  the  gels.  The  protein  standards 
were  run  in  a  parallel  gel-filtration  experiment  under  identical  conditions.  (I)  Elution  profile  of  native  KAP-l-RBCC 
protein;  (II)  the  elution  profile  of  KOX1-KRAB  protein;  (III)  the  elution  profile  for  a  pre-formed  KOXl-KRAB:KAP-l- 
RBCC  complex;  (IV)  the  profile  of  the  material  from  fraction  11  (III)  re-chromatographed  on  the  Gel-filtration  column, 
(b)  Sucrose  gradient  sedimentation  analysis  of  the  KAP-l-RBCC  domain,  KOX1-KRAB  domain,  and  the  KOX1- 
KRAB:KAP-1-RBCC  complex.  The  proteins  from  sucrose  gradient  fractions  were  analyzed  by  SDS-PAGE.  The  fraction 
numbers  are  indicated  at  the  top  of  each  lane.  The  peak  position  of  protein  standards  obtained  from  gradients  run  in 
parallel  is  indicated  above  the  gels,  (c)  Determination  of  the  molecular  mass  of  native  KAP-l-RBCC  by  ND-PAGE. 
The  calibration  curve  prepared  from  Ferguson  plots  is  shown  (see  Materials  and  Methods  for  details).  The  known 
masses  of  standards  and  the  calculated  mass  of  native  KAP-l-RBCC  are  given  (in  kDa)  above  the  curve. 


discrete  KAP-l-RBCC  oligomer  and  stabilizes  this 
oligomeric  form,  which  is  most  consistent  with  a 
trimer.  These  cross-linking  results  are  consistent 
with  the  sedimentation  observed  for  a  pre-formed 
KOXl-KRAB:KAP-l-RBCC  complex  in  sucrose  gra¬ 
dients  (Figure  4(b)-III). 

Analysis  of  the  mutant  KAP-l-RBCC  proteins 
(CC65,68AA)  and  (L270P),  which  were  severely 
impaired  in  KRAB  binding  (Figures  1(d)  and  3(a) 
and  (c)),  showed  little  to  no  cross-linked  species 
(dimer,  trimer,  or  hexamer)  under  the  same  cross- 
linking  conditions  (Figure  6(c)  and  (d)).  Further¬ 
more,  these  mutant  proteins  migrated  with  an 
apparent  molecular  mass  of  approximately  66  kDa 
in  sucrose  gradients  and  this  migration  was  not 
altered  by  pre-incubation  with  the  KOX1-KRAB 
protein  (data  not  shown).  These  data  suggest  that 
the  formation  of  the  KOXl-KRAB:KAP-l-RBCC 
complex  likely  requires  the  proper  oligomeric  state 
of  the  KAP-l-RBCC,  which  is  apparently  a  trimer, 
and  that  KRAB  binding  stabilizes  this  state  in 
solution. 

The  studies  described  above  were  exclusively 
performed  with  bacterial-expressed  proteins  that 
lack  any  eukaryotic  post-translational  modifi¬ 
cations  and  the  potential  to  interact  with  endogen¬ 
ous  cellular  partner  proteins.  To  determine  if  our 
observations  extend  to  eukaryotic  cell-expressed 


proteins,  we  overexpressed  the  KAP-l-RBCC  and 
its  mutant  derivatives  in  Sf9  insect  cells  using  a 
baculovirus  expression  vector.  Full-length  wild- 
type  KAP-1,  KAP-l-RBCC  and  mutants  thereof 
were  highly  expressed  as  soluble  proteins  in  Sf9 
cells.  The  bv.KAP-l-RBCC  was  purified  under 
native  conditions  and  migrated  as  a  single  poly¬ 
peptide  band  consistent  with  its  predicted  mono¬ 
meric  molecular  mass  of  47.7  kDa  (data  not 
shown).  This  behavior  suggests  that  bv.KAP-l- 
RBCC  is  not  significantly  subjected  to  extensive 
post-translational  modification,  as  detected  by 
SDS-PAGE  when  expressed  in  eukaryotic  cells.  The 
bv.KAP-l-RBCC  protein  behaved  identically  with 
the  E.  co/z-expressed  protein  in  all  of  the  biochemi¬ 
cal  binding  assays  and  was  equally  active  in  KRAB 
domain  interaction  (data  not  shown). 

To  estimate  the  native  mass  and  oligomeric 
states  of  KAP-1  proteins  (wild-type  and  mutants) 
expressed  in  the  baculovirus  expression  system, 
lysates  from  infected  cells  that  contain  the 
expressed  proteins  were  subjected  to  gel-filtration 
chromatography  and  detected  in  a  Western  blot 
assay  (Figure  7(a)).  The  full-length  KAP-1  protein 
eluted  with  an  apparent  molecular  mass  of  500  to 
670  kDa  (Figure  7(a)-I).  This  elution  profile  was 
identical  with  both  bacterial-expressed  full-length 
KAP-1  and  endogeneous  KAP-1  derived  from 
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Figure  5.  Representation  of  analytical  ultracentrifuga¬ 
tion  analysis  of  KAP-l-RBCC.  Sedimentation  equili¬ 
brium  analysis  of  KAP-l-RBCC  was  performed  with 
three  different  loading  concentrations  in  separate  cells  at 
13,000  rpm  and  4°C.  The  circles  in  the  main  panel  (d) 
show  the  concentration  versus  radius  data  for  the  three 
cells  at  equilibrium.  The  three  data  sets  were  fitted 
separately  with  a  model  describing  monomers,  trimers, 
and  hexamers,  using  the  non-linear  regression  program 
NONLIN  (Johnson  et  ah,  1981).  The  calculated  fit  is 
represented  by  the  continuous  lines,  (a)-(c)  show  the 
residuals  for  the  data  points  of  the  fitted  curves  at  the 
three  protein  concentrations  displayed  from  highest  con¬ 
centration  (top)  to  lowest  concentration. 


mammalian  cell  nuclear  extracts  (unpublished 
results).  Since  the  apparent  monomeric  molecular 
mass  measured  by  SDS-PAGE  of  full-length  KAP-1 
is  97  kDa,  this  result  suggests  that  native  full- 
length  KAP-1  was  an  oligomer,  possibly  as  large  as 
a  hexamer.  The  bv.KAP-l-RBCC  protein  eluted 
with  an  identical  Stokes  radius  as  the  E.  coli  KAP- 
l-RBCC  protein  at  approximately  158  kDa 
(Figure  7(a)-II),  suggesting  that  E.  coli  and 
bv. -expressed  KAP-l-RBCC  share  a  common  oligo¬ 
meric  state.  Mutations  in  the  RING  finger 
(CC65,68AA)  and  the  coiled-coil  region  (L270P)  of 
the  KAP-l-RBCC  appeared  to  alter  oligomerization 
properties,  as  these  mutant  proteins  eluted  with  an 
apparent  molecular  mass  of  100  kDa  (Figure  7(a)- 


111,1V),  and  also  failed  to  bind  GAL4-KRAB  in  the 
EMSA  assay  (Figure  8(a),  lanes  3  and  4).  These 
data  are  consistent  with  those  observed  for  pro¬ 
teins  produced  in  E.  coli ,  and  strengthen  the  argu¬ 
ment  that  the  KAP-l-RBCC  must  properly 
oligomerize  for  KRAB  binding,  which  is  indepen¬ 
dent  of  post-translational  modifications. 

Formation  and  detection  of  KAP-1  :KAP-1- 
RBCC  hetero-oligomers  in  vivo 

To  reconstitute  hetero-oligomers  of  the  KAP-l- 
RBCC  domain  with  full-length  KAP-1  in  vivo,  we 
co-expressed  full-length  KAP-1  with  either  wild- 
type  or  mutant  KAP-l-RBCC  proteins  via  co-infec¬ 
tion  of  Sf9  cell  cultures  (Figure  7(b)).  Because  the 
KAP-l-RBCC  encodes  an  NH2-terminal  6His  tag, 
its  association  with  wild-type  full-length  KAP-1 
could  be  detected  by  Ni-NTA  chromatography,  fol¬ 
lowed  by  Western  blot  analysis  using  an  anti-KAP- 
1  serum  that  detects  amino  acid  residues  423-589,  a 
region  that  is  not  present  in  the  KAP-l-RBCC  pro¬ 
tein  (Friedman  et  al,  1996)  (Figure  7(b)).  The  KAP- 
l-RBCC  bound  to  the  Ni-NTA  resin  was  detected 
with  an  anti-His  monoclonal  antibody.  The  full- 
length  bv.KAP-1  (which  has  no  6His  tag)  did  not 
bind  to  Ni-NTA  resin  (Figure  7(b),  lane  1).  When 
the  Ni-NTA  resin  was  incubated  with  Sf9  cell 
lysates,  in  which  full-length  KAP-1  protein  was  co¬ 
expressed  with  the  KAP-l-RBCC,  we  observed 
retention  of  the  full-length  KAP-1  protein 
(Figure  7(b),  lane  2).  These  results  suggest  that  the 
KAP-l-RBCC  domain  can  hetero-oligomerize  with 
the  hill-length  KAP-1  protein.  Interestingly,  simple 
mixing  of  cell  lysates  that  contained  independently 
expressed  proteins  failed  to  recapitulate  this  inter¬ 
action  (data  not  shown).  Thus,  the  formation  of 
hetero-oligomers  between  full-length  KAP-1  and 
the  KAP-l-RBCC  domain  may  be  coupled  to  trans¬ 
lation  or  other  post-translation  processes,  an  obser¬ 
vation  that  has  been  made  for  other  oligomeric 
coiled-coil  proteins.  The  KAP-l-RBCC  mutants 
CC65,68AA  and  L270P  also  efficiently  associated 
with  full-length  KAP-1  in  vivo  using  the  co-infec- 
tion  assay  (Figure  7(b),  lanes  3  and  4).  This  result 
suggests  that  these  mutations  inhibit  the  homo¬ 
oligomerization  of  the  RBCC  domain  but  that  these 
RBCC  mutants  can  still  hetero-oligomerize  with  a 
wild-type  RBCC  domain  in  vivo. 

Since  the  CC65,68AA  and  L270P  mutant  RBCC 
proteins  could  apparently  hetero-oligomerize  with 
full-length  KAP-1,  based  upon  the  data  above,  we 
tested  the  ability  of  these  hetero-oligomers  to  inter¬ 
act  with  the  KRAB  domain  in  the  EMSA  assay. 
The  bv.KAP-l-RBCC  formed  the  characteristic 
complex  with  the  DNA-bound  GAL4-KRAB  pro¬ 
tein  (Figure  8(a),  lane  2).  The  mutant  proteins 
CC65,68AA  and  L270P  each  failed  to  form  this 
ternary  complex  (Figure  8(a),  lanes  3  and  4),  a 
result  that  is  consistent  with  our  previous  findings, 
in  which  E.  co/z-derived  mutant  RBCC  proteins 
failed  to  bind  the  KRAB  domain  (Figures  1(d)  and 
3(a)  and  (c)).  The  full-length  bv.KAP-1  protein 
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Figure  6.  Chemical  cross-linking  analysis  of  the  KAP-l-RBCC  domain.  Equal  amounts  (5  jiM)  of  (a)  wild-type 
KAP-l-RBCC,  (b)  a  KOX1-KRAB: KAP-1 -RBCC  complex,  (c)  the  (CC65,68AA)  mutant,  and  (d)  the  (L270P)  mutant 
were  incubated  with  various  concentrations  of  EGS.  The  cross-linked  KAP-l-RBCC  products  were  resolved  by  SDS- 
PAGE  and  visualized  by  Western  blot  analysis.  An  anti-His  monoclonal  antibody  (Qiagen)  was  used  to  detect  the 
KAP-l-RBCC  proteins  in  panels  (a),  (c)  and  (d).  An  anti-KAP-l-RBCC  rabbit  IgG  was  used  to  detect  the  KAP-l-RBCC 
protein  in  (b).  The  position  of  molecular  markers  is  indicated  on  the  left  of  each  Western  blot.  The  estimated  multi¬ 
meric  forms  of  the  proteins  are  indicated  on  the  right  of  each  Western  blot. 
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Figure  7.  Expression  and  func¬ 
tional  analysis  of  KAP-1  proteins 
expressed  from  baculovirus  vectors, 
(a)  Gel-filtration  analysis  of  full- 
length  KAP-1  (I),  wild-type  KAP-1 - 
RBCC  (II),  and  mutant  KAP-1- 
RBCC  (III,  IV).  The  elution  of  each 
protein  from  a  Superdex  200  col¬ 
umn  was  detected  by  Western  blot 
analysis.  The  fraction  numbers  are 
labeled  at  the  top  of  each  lane,  and 
the  protein  standards  are  indicated 
above  the  blots.  An  affinity-purified 
rabbit  polyclonal  anti-KAP-1  serum 
was  used  to  detect  full-length  KAP- 
1.  An  anti-His  monoclonal  antibody 
was  used  to  detect  the  KAP-1- 
RBCC  proteins,  (b)  Formation  and 
detection  of  KAP-1:  KAP-1 -RBCC 
hetero-oligomers  in  vivo.  Diagrams 
of  full-length  KAP-1  and  6His 
KAP-1 -RBCC  proteins  expressed  in 
baculovirus  are  presented.  The  pro¬ 
teins  were  either  expressed  indivi¬ 
dually,  or  co-expressed  in  Sf9  cells. 
Hetero-oligomerization  of  the  KAP- 
1-RBCC  domain  with  full-length 
KAP-1  was  detected  by  binding  the 
complex  to  Ni-NTA  resin.  The 
bound  proteins  were  detected  by 
Western  blot  analysis.  The  bound 
full-length  KAP-1  protein  was 
detected  using  a  affinity  purified 
rabbit  polyclonal  anti-KAP-1 
serum,  and  the  KAP-l-RBCC  pro¬ 
teins  were  detected  with  the  anti- 
His  monoclonal  antibody.  The  - 
4-  represents  the  viruses  used  to 
infect  Sf9  cells. 


yielded  a  supershift  that  was  easily  distinguishable 
from  the  supershift  produced  by  bv.KAP-l-RBCC 
(Figure  8(a),  lane  5).  It  is  well  established  that  this 
difference  can  be  exploited  as  an  assay  for  hetero¬ 
oligomerization  of  proteins  by  searching  for  inter¬ 
mediate  mobility  supershifts  (Hope  &  Struhl, 
1987).  The  bv.KAP-1  and  bv.KAP-l-RBCC  co¬ 
infected  extracts  produced  a  spectrum  of  five  inter¬ 
mediate  supershifts  (Figure  8(a),  lane  6),  including 
the  bands  that  co-migrate  with  the  authentic 
bv.KAP-1  and  bv.KAP-l-RBCC  supershifts.  This 
suggests  that  discrete  hetero-oligomers  composed 
of  various  combinations  of  full-length  KAP-1  and 
wild-type  RBCC  domain  can  bind  to  the  KRAB 
domain.  Each  of  these  supershifts  contains  either 
KAP-1,  KAP-l-RBCC,  or  a  combination  thereof  as 
determined  by  antibody  supershift  experiments 
(data  not  shown).  If  the  complex  between  bv.KAP- 
1  and  bv.KAP-l-RBCC  was  a  simple  hetero-dimer, 
a  single  intermediate  species  would  be  predicted. 
The  presence  of  multiple  discrete  species  strongly 


suggests  that  hetero-oligomers  are  formed,  most 
likely  hetero-trimers.  Co-expression  of  KAP-1  with 
either  mutant  RBCC  protein  yielded  mobility  shifts 
that  were  consistent  only  with  the  supershift 
observed  for  full-length  KAP-1  (Figure  8(a),  lanes  7 
and  8).  A  plausible  interpretation  of  these  results  is 
illustrated  by  Figure  8(b).  Bands  A  and  B  represent 
hexamer  and  trimer  forms  of  full-length  KAP-1, 
respectively,  which  are  in  equilibrium;  the  hexamer 
is  formed  most  efficiently  by  the  full-length  KAP-1 
protein  and  binds  with  highest  affinity  to  the 
KRAB  domain.  Bands  C  and  D  are  hetero-trimers 
that  contain  one  and  two  molecules  of  KAP-l- 
RBCC  per  complex,  respectively.  These  complexes 
can  bind  the  KRAB  domain  but  may  lack  the  abil¬ 
ity  to  form  a  hexamer  under  these  conditions. 
Band  E  is  the  homo-trimer  of  wild-type  KAP-l- 
RBCC.  The  mutant  RBCC  proteins  retain  the  ability 
to  hetero-trimerize,  but  any  complex  containing  a 
single  mutant  molecule  would  be  inactive  for 
KRAB  domain  binding,  as  evidenced  by  the  loss  of 
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Figure  8.  Binding  of  KAP-l-RBCC  :KAP-1  hetero-oligomers  to  a  DNA-bound  KRAB  domain  in  vitro,  (a)  The  binding 
of  KAP-1-RBCC:KAP-1  hetero-oligomers  to  GAL4-KRAB,  as  detected  by  EMSA.  Each  reaction  contained  a  constant 
amount  of  purified  GAL4-KRAB  (100  ng),  5  gg  of  baculovirus  whole-cell  lysates,  in  which  full-length  KAP-1  protein 
was  co-expressed  with  either  the  wild-type  or  mutant  KAP-1 -RBCC  protein,  and  1  pi  of  32P-labeled  GAL4  probe.  The 
symbols  on  the  left  side  of  the  panel  define  the  following  complexes:  dash,  DNA:GAL4-KRAB  alone;  filled  arrow, 
DNA:GAL4-KRAB:KAP-1-RBCC  complex;  open  arrowhead,  DNA:GAL4-KRAB:KAP-1:RBCC  complex;  bracket,  DNA:- 
GAL4-KRAB:KAP-1  complex.  The  free  probe  was  allowed  to  run  off  the  bottom  of  the  gel  for  better  resolution  of  the 
mobility  shifts  at  the  top  of  the  gel.  Note  that  no  binding  of  the  (CC65,68AA),  or  the  (L270P)  mutant  KAP-1 -RBCC  pro¬ 
teins  to  GAL4-KRAB  was  detected  (lanes  3  and  4).  Also,  hetero-oligomers  of  full-length  KAP-1  with  either  of  the 
mutant  KAP-1 -RBCC  proteins  failed  to  yield  any  intermediate  supershifts  (lanes  7  and  8)  of  the  GAL4-KRAB  complex, 
which  were  observed  for  hetero-oligomers  of  full-length  with  the  wild- type  KAP-1 -RBCC  domain  (lane  6).  (b)  An 
interpretation  for  the  formation  of  GAL4-KRAB:KAP-1:KAP-1-RBCC  hetero-oligomers  shown  in  the  EMSA  assay  (a). 


complexes  C,  D,  and  E  in  Figure  8(a),  lanes  7  and 
8.  These  results  suggest  that  oligomerization  alone 
is  not  sufficient  for  interaction  of  KAP-1  with  the 
KRAB  domain.  Moreover,  individual  components 
within  the  RING  finger,  B  boxes,  and  coiled-coil 
region  independently  assist  the  proper  oligomeri¬ 
zation  of  KAP-1  and  its  interaction  with  the  KRAB 
domain. 

Heterologous  oligomerization  domains  fail  to 
reconstitute  the  KOX1 -KRAB: KAP-1  interaction 

The  data  presented  above  suggest  a  simple 
model  for  the  KAP-l-RBCC  domain:  that  the 
coiled-coil  region  is  a  parallel  trimeric  association 
that  serves  to  properly  orient  the  RING  finger  and 


B  boxes  for  binding  to  the  KRAB  domain.  If  this 
hypothesis  is  correct,  we  may  be  able  to  substitute 
a  similarly  structured  coiled-coil  domain  to  recon¬ 
stitute  KRAB  binding.  We  thus  substituted  the 
coiled-coil  region  of  KAP-1  with  well-characterized 
heterologous  oligomerization  domains  for  trimeri- 
zation  (HSF)  (Drees  et  al,  1997),  dimerization 
(GCN4-LZ),  and  tetramerization  (GCN4-TZ) 
(Waterman  et  al.,  1996)  (Figure  1(d)).  These  heter¬ 
ologous  KAP-l-RBCC  fusion  proteins  were 
expressed  as  soluble  proteins  and  were  properly 
oligomerized,  as  determined  by  chemical  cross- 
linking  and  gel  filtration  (data  not  shown).  As 
illustrated  in  Figures  1(d)  and  9,  each  of  these  pro¬ 
teins,  KAP-RB-[HSFTri],  KAP-RB- [GCN4Dim],  and 
KAP-RB-[GCN4Xet],  failed  to  interact  with  the 
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Figure  9.  Substitution  of  the  RING  finger,  B-boxes, 
and  the  coiled-coil  sub-domains  of  KAP-1  disrupts  the 
KOXl-KRAB:KAP-l  interaction:  1  pg  of  each  wild-type 
KAP-l-RBCC  or  heterologous  KAP-l-RBCC  fusion  pro¬ 
tein  was  tested  for  its  ability  to  interact  with  the  KRAB 
domain  in  vitro ,  via  the  EMSA.  The  wild-type  KAP-l- 
RBCC  protein  efficiently  supershifted  the  DNA-bound 
GAL4-KRAB  protein  to  form  a  complex  indicated  by  the 
bracket.  None  of  the  heterologous  RBCC  fusion  proteins 
demonstrated  this  activity,  although  the  RMIDBCCKAP 
fusion  protein  showed  a  little  of  this  activity.  The  filled 
arrow  indicates  the  DNA:GAL4-KRAB  mobility  shift. 
FP,  free  probe. 


DNA-bound  GAL4-KRAB  protein.  Thus,  the  KAP- 
1  coiled-coil  region  may  provide  a  unique  oligo¬ 
merization  surface  required  for  KRAB  binding  or 
contribute  directly  to  KOX1-KRAB  domain  recog¬ 
nition  via  specific  contacts. 

The  studies  described  here  suggest  that  the 
RING  finger,  B  boxes,  and  coiled-coil  region  collec¬ 
tively  appear  to  constitute  a  tripartite  interface  for 
protein-protein  interactions  that  is  conserved  in  a 
number  of  cellular  proteins  including  TIFla,  TIFly, 
and  MIDI  (Figure  1).  To  more  precisely  address 
the  specificity  of  each  motif  in  this  apparent  pro¬ 
tein-protein  interaction  domain  of  KAP-1,  we  have 
substituted  the  RING  finger,  B  boxes,  and  coiled- 
coil  sub-domains  of  KAP-1  with  the  corresponding 
motif  from  MIDI  (Quaderi  et  al.,  1997) 
(Figure  1(d)).  We  expressed  and  purified  the 
MIDI -RBCC  domain  and  heterologous  KAP-1/ 


MID1-RBCC  fusion  proteins  from  bacteria  (data 
not  shown),  and  tested  their  ability  to  interact  with 
the  KRAB  domain.  In  general,  the  MID1-RBCC 
domain  and  each  of  the  KAP-1 /MID  RBCC  fusion 
proteins  failed  to  bind  to  the  DNA-bound  GAL4- 
KRAB  protein.  Flowever,  RMidbCCkap  fusion  pro¬ 
tein  showed  weak  potential  to  shift  the  DNA:- 
GAL4-KRAB  binary  complex  (Figures  1(d)  and  9). 

Together,  these  data  suggest  that  the  RBCC 
domain  of  KAP-1  functions  as  an  integrated  struc¬ 
tural  unit  to  independently  facilitate  the  oligomeri¬ 
zation  of  KAP-1  and  its  highly  specific  interaction 
with  the  KRAB  domain. 


Discussion 

The  RBCC  tripartite  domain  has  emerged  as 
a  highly  conserved  module  in  proteins  that  play 
causal  roles  in  cellular  differentiation  and  develop¬ 
ment,  and  whose  alteration  can  result  in  human 
disease  (El-Husseini  &  Vincent,  1999;  Ogawa  et  al, 
1998;  Perrin  &  Lacroix,  1998;  Tatematsu  et  al, 
1998).  The  spatial  conservation  of  sub-domains  in 
the  RBCC  motif  and  their  presence  in  large  cellular 
complexes  has  led  to  the  suggestion  that  the  RBCC 
domain  mediates  specific  protein-protein  inter¬ 
actions.  However,  these  targets,  even  for  the  most 
well-characterized  RBCC  proteins  have  eluded 
investigators. 

Here,  we  present  the  first  demonstration  that  a 
highly  specific,  physiologically  relevant  protein- 
protein  interaction  can  be  reconstituted  with  a  pur¬ 
ified  RBCC  domain  and  its  corresponding  binding 
partner,  thus  providing  proof  that  the  RBCC 
domain  is  a  protein-protein  binding  surface.  We 
can  draw  the  following  conclusions  from  our  data: 
(1)  the  interaction  is  direct;  it  does  not  apparently 
require  post-translational  modification  and  is  stable 
to  in  vitro  biochemical  manipulation.  (2)  A  stable 
ternary  complex  can  be  formed  between  DNA,  a 
DNA-bound  KRAB  domain  and  the  KAP-l-RBCC 
protein;  thus,  the  KRAB-KAP-1  interaction  does 
not  inhibit  DNA-binding  in  the  system  we  have 
used.  (3)  Recognition  of  the  KRAB  domain  by  the 
KAP-l-RBCC  is  highly  specific,  as  the  related 
RBCC  proteins  MIDI  and  TIFla  do  not  bind  under 
these  experimental  conditions.  (4)  Binding  requires 
structural  elements  of  all  the  sub-domains  of  the 
KAP-l-RBCC,  as  mutation  of  each  individual  sub- 
domain  has  the  potential  to  abolish  KRAB  domain 
binding.  (5)  Each  RBCC  sub-domain  may  co¬ 
operate  in  providing  specificity  for  interaction,  as 
individual  sub-domain  swaps  with  related  RBCC 
proteins  fail  to  recapitulate  binding.  (6)  The  KAP- 
l-RBCC  domain  must  properly  homo-oligomerize 
in  order  to  bind  the  KRAB  domain:  this  oligomeri¬ 
zation  state  is  probably  a  trimer  and  further  associ¬ 
ation  to  a  hexamer  may  occur.  (7)  Oligomerization 
is  necessary  but  not  sufficient  for  KRAB  domain 
binding,  implying  that  separate  functional  domains 
for  oligomerization  and  KRAB  recognition  exist. 
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(8)  The  coiled-coil  sub-domain  itself  provides 
critical  binding  and/or  determines  orientational 
specificity  for  KRAB  domain  binding,  as 
heterologous  oligomerization  domains  fail  to  reca¬ 
pitulate  binding.  The  picture  that  emerges  is  that 
the  KAP-l-RBCC  domain  forms  an  integrated  unit 
composed  of  essential  sub-domains  that  probably 
function  cooperatively  to  recognize  a  protein 
ligand  with  high  specificity. 

Although  it  has  been  demonstrated  that  many 
RBCC  proteins  oligomerize,  its  role  in  ligand  bind¬ 
ing  has  not  been  determined.  Our  data  with  the 
KRAB-KAP-1  system  firmly  establish  that  oligo¬ 
merization  is  required  for  binding.  A  major  issue  is 
the  oligomerization  of  KAP-1,  before  and  after 
KRAB  binding  and  whether  binding  stabilizes  the 
oligomeric  species.  We  favor  the  interpretation  that 
the  KRAB-bound  KAP-l-RBCC  domain  is  a  trimer 
or  hexamer  based  upon  a  number  of  observations. 
Analysis  of  the  predicted  coiled-coil  region  using 
the  multi-coils  program  predicts  the  potential  for 
two  independent  coiled  regions  with  the  highest 
probability  that  each  forms  a  trimer.  Gel  filtration 
of  soluble  E.  coli  KAP-l-RBCC  in  the  presence  or 
absence  of  the  KRAB  domain  shows  a  complex  of 
apparent  molecular  mass  three  times  greater  than 
the  monomeric  molecular  mass.  Chemical  cross- 
linking  in  solution  shows  apparent  trimeric  and 
hexameric  forms  of  the  KAP-l-RBCC  as  the  predo¬ 
minant  species,  and  the  production  of  the  trimeric 
cross-linkable  form  is  greatly  enhanced  by  pre¬ 
incubation  with  recombinant  KRAB  domain. 
Analytical  ultracentrifugation  experiments  show 
mixtures  of  monomers,  trimers,  and  hexamers. 
Together,  these  observations  are  consistent  with 
the  potential  for  a  monomer-trimer-hexamer  equili¬ 
brium.  The  KAP-l-RBCC  protein  also  sedimented 
as  an  apparent  monomer  in  standard  velocity  sedi¬ 
mentation  in  sucrose  gradients,  suggesting  that 
oligomers  were  not  stable  under  these  conditions. 
However,  pre-incubation  of  the  KAP-l-RBCC  with 
the  KRAB  domain  resulted  in  a  sucrose  gradient 
sedimentation  profile  of  the  complex  consistent 
with  a  trimer.  The  shift  from  apparent  monomer  to 
apparent  trimer  in  sedimentation  is  unlikely  to  be 
accounted  for  by  simple  incorporation  of  multiple 
molecules  of  the  KRAB  domain  into  the  complex, 
since  preliminary  stoichiometry  estimates  suggest 
that  the  KAP-1 -RBCC: KRAB  complex  is  approxi¬ 
mately  3:1  (unpublished  results).  In  summary,  the 
behavior  of  the  complex  in  these  assays  most  likely 
indicates  KRAB-induced  trimer  formation  or  a  con¬ 
formational  change  detected  by  sedimentation. 

We  confirmed  these  studies  using  KAP-l-RBCC 
and  full-length  KAP-1  protein  expressed  in  eukary¬ 
otic  cells.  The  co-infection  experiments  clearly 
show  that  hetero-oligomers  can  be  efficiently 
formed  between  full-length  KAP-1  and  KAP-l- 
RBCC,  and  that  the  hetero-oligomers  are  functional 
for  KRAB  binding.  Most  importantly,  three  inter¬ 
mediate  supershifts  were  observed.  This  strongly 
suggests  that  the  active  binding  species  is  oligo¬ 
meric,  most  likely  a  trimer  or  hexamer. 


The  mutations  in  either  the  RING  finger 
(CC65,68AA)  or  the  coiled-coil  region  (L270P) 
when  expressed  individually  in  Sf9  cells  were  inac¬ 
tive  in  KRAB  domain  binding.  Thus,  either  they 
are  defective  in  homo-oligomerization  or  form  an 
oligomer  unproductive  for  KRAB  binding.  Both  of 
these  mutant  proteins  show  slightly  smaller  appar¬ 
ent  molecular  masses  in  gel  filtration,  suggesting 
an  unfolded  state.  This  apparent  monomeric 
species  cannot  be  converted  to  the  apparent  oligo¬ 
meric  species  (as  detected  by  sucrose  gradient  sedi¬ 
mentation  or  cross-linking)  via  incubation  with  the 
KRAB  domain.  Thus,  the  apparent  KRAB-induced 
oligomerization  is  required  for  binding  and  both 
the  RING  finger  and  the  first  coiled-coil  are 
important  for  this  oligomerization. 

However,  it  is  puzzling  that  these  mutants, 
when  co-expressed  with  full-length  KAP-1  in  bacu- 
lovirus-infected  cells  efficiently  form  a  hetero-oligo- 
meric  complex.  Clearly,  these  hetero-oligomeric 
complexes  are  not  competent  for  KRAB  domain 
binding,  since  there  were  no  intermediate  super¬ 
shift  bands  observed  from  a  mutant /wild-type 
co-infected  extract.  Thus,  either  the  binding  is  non¬ 
specific,  or  the  full-length  KAP-1  protein  compen¬ 
sates  for  an  oligomerization  defect  in  the  mutant 
RBCC  protein. 

Hetero-oligomers  that  contain  one  mutant  sub¬ 
unit  are  apparently  defective  in  KRAB  binding  and 
thus  these  hetero-oligomers  appear  to  act  as  true 
dominant-negatives.  This  behavior  has  been 
observed  in  many  other  oligomeric  proteins,  where 
functionally-separate  oligomerization  and  ligand 
binding  domains  are  present  (Hurst,  1995; 
Baxevanis  &  Vinson,  1993).  An  important  point  is 
that  the  RING  finger  mutant  (CC65,68AA)  fails  to 
form  a  productive  homo-oligomer  or  hetero-oligo¬ 
mer  with  wild-type  KAP-1,  even  though  it  contains 
an  intact  coiled-coil  region.  This  observation  clearly 
demonstrates  that  the  RING  finger  participates 
both  in  oligomerization  and  ligand  recognition. 

Our  mutational  analysis  of  the  individual  RBCC 
sub-domains  clearly  shows  that  the  core  structural 
integrity  of  the  RING  finger  appears  to  be  required 
for  KRAB  domain  binding.  Many  engineered  and 
naturally  occurring  RING  finger  mutations  have 
been  studied  in  other  proteins  but  none  has  yet 
been  correlated  with  alteration  of  a  specific  RING- 
protein  interaction  in  a  purified  system.  Homo  or 
hetero-oligomerization  via  RING-RING  interaction 
has  been  observed  in  PML,  BRCA1,  Bmi-1,  Mph2 
(Borden  et  al. ,  1995a;  Hemenway  et  al,  1998;  Meza 
et  al,  1999).  The  RING  finger  mutant  BRCA1  pro¬ 
teins  show  altered  oligomerization,  subcellular 
localization,  interaction  with  protein  partners,  and 
ability  to  suppress  cell  growth  (reviewed  by 
Irminger-Finger  et  al. ,  1999).  RING  finger 
mutations  in  the  proto-oncogene  PML  have  pro¬ 
found  effects  on  the  localization  of  the  protein  to 
the  ND10  nuclear  bodies  (Borden  et  ah,  1995a), 
which  have  been  correlated  with  effects  on  both 
cell  survival  mediated  by  wild-type  PML  (Borden 
et  al.,  1997)  and  oncogenesis  as  mediated  by  the 
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PML-RARa  fusion  (Fagioli  et  ah,  1998).  Finally,  the 
RING  finger  of  RFP  is  critical  to  its  cell  transform¬ 
ation  function  as  the  rfp-ret  fusion  oncogene  (Cao 
et  ah,  1997, 1998;  Hasegawa  et  ah,  1996). 

The  mutational  analysis  of  the  B1  and  B2  boxes 
of  KAP-1  has  also  been  informative.  Interestingly, 
the  triple-cysteine  substitution  in  the  NH2-terminal 
B1  box  had  no  effect  on  KRAB  domain  binding, 
whereas  a  similar  substitution  in  the  analogous 
cysteine  residues  in  the  B2  box  completely  abol¬ 
ished  KRAB  domain  binding.  If  these  substitutions 
abolished  the  structural  integrity  of  both  B  boxes, 
it  is  reasonable  to  conclude  that  there  is  an  unequal 
contribution  of  each  B-box  to  KRAB  domain  bind¬ 
ing  and/or  KAP-1  oligomerization.  The  evidence 
that  the  B-box  is  required  for  the  proper  orientation 
of  the  coiled-coil  region  (Cao  et  ah,  1997)  could 
explain  why  mutation  of  the  B2  box  adjacent  to  the 
coiled-coil  in  KAP-1  abolished  KRAB  binding, 
whereas  mutation  of  the  distal  B1  box  had  no 
effect. 

Oligomerization  mediated  by  the  coiled-coil 
region  is  one  of  the  most  important  determinants 
of  the  biological  properties  of  RBCC  proteins, 
and  is  likely  responsible  for  their  legendary 
potential  for  aggregation  and  multimerization 
in  vitro.  The  L270P  substitution  in  the  KAP-1 - 
RBCC  expressed  in  E.  coli  showed  aberrant  sedi¬ 
mentation  and  failure  to  oligomerize  in  the 
chemical  cross-linking  assay  (in  the  absence  of 
the  KRAB  domain).  These  properties  correlated 
with  an  impaired  ability  to  bind  GAL4-KRAB, 
although  significant  binding  was  detected  at 
very  high  levels  of  input  KAP-l-RBCC  (L270P). 
However,  baculovirus-expressed  KAP-l-RBCC 
(L270P)  was  completely  inactive  for  GAL4-KRAB 
binding,  even  at  very  high  concentrations.  Sur¬ 
prisingly,  the  bv. KAP-l-RBCC  (L270P)  still  effi¬ 
ciently  bound  to  wild-type  full-length  bv.KAP-1 
in  co-infected  cells.  However,  these  hetero-oligo- 
meric  complexes  were  not  able  to  bind  the 
KRAB  domain,  thus  exhibiting  an  apparent 
dominant-negative  activity  essentially  identical 
with  the  KAP-l-RBCC,  RING  finger  mutant 
(CC65,68AA)  discussed  above.  Like  the  RING 
finger  mutant,  we  see  a  difference  in  homo-oli- 
gomeric  versus  hetero-oligomeric  potential  with 
wild-type  KAP-1  in  the  KAP-l-RBCC  (L270P) 
mutant.  The  L306P  substitution  in  coil  II  is 
clearly  different.  The  E.  coli- produced  L306P  pro¬ 
tein  is  completely  blocked  in  its  ability  to  bind 
the  KRAB  domain,  even  at  high  concentration. 
Thus,  either  the  L270  and  L306  leucine  residues 
that  we  targeted  are  not  equivalent  in  terms  of 
their  contribution  to  each  coiled-coil  structure,  or 
coil  I  and  coil  II  are  not  equivalent  in  their  con¬ 
tribution  to  either  oligomerization  or  KRAB 
domain  recognition.  It  is  a  distinct  possibility 
that  one  (or  both)  coils  participate  directly  in 
KRAB  domain  recognition.  In  summary,  our 
mutational  analysis  for  the  KAP-l-RBCC  sub- 
domains  strongly  suggests  that  each  sub-domain 
contributes  to  KRAB  domain  binding,  either  as  a 


direct  recognition  element  or  indirectly  by  med¬ 
iating  appropriate  homo-oligomerization. 

To  begin  distinguishing  these  possibilities,  we 
constructed  a  number  of  fusion  proteins  containing 
heterologous  oligomerization  domains  and 
"swapped"  RBCC  sub-domains  from  KAP-1  ortho- 
logues.  Our  strategy  of  replacing  the  coiled-coil 
region  of  KAP-1  with  heterologous  oligomerization 
domains  was  based  on  the  following  plausible 
model  for  the  KAP-l-RBCC  oligomer:  that  the 
coiled-coil  region  forms  a  parallel  trimeric  associ¬ 
ation  of  helices  that  orient  the  RING  finger  and  B- 
boxes  for  KRAB  binding.  If  the  coiled-coil  serves 
only  as  an  oligomerization  motif,  it  would  be  pre¬ 
dicted  that  it  could  be  substituted  for  by  a  similar 
motif,  as  has  been  shown  in  other  oligomeric  pro¬ 
teins.  The  best  evidence  that  supports  such  a 
potential  model  comes  from  recent  structural  ana¬ 
lyses  of  the  TRAF  proteins  (McWhirter  et  ah,  1999; 
Park  et  ah,  1999),  which  encode  RING  finger  and 
coiled-coil  regions  and  are  a  subfamily  of  the 
RBCC  proteins.  The  TRAFs  homo  and  hetero-tri- 
merize  via  a  parallel  association  of  the  coiled-coil 
region.  Trimerization  properly  orients  the  TRAF 
domain  for  binding  to  the  cytoplasmic  receptor, 
and  this  trimerization  function  can  be  substituted 
by  heterologous  trimerization  domains  (McWhirter 
et  ah,  1999;  Park  et  ah,  1999).  However,  KAP-l-RB 
proteins  fused  to  heterologous  dimerization,  tri¬ 
merization,  or  tetramerization  domains  completely 
failed  to  bind  the  KRAB  domain.  Our  alternative 
approach,  using  domain  swaps  from  the  MID1- 
RBCC  protein  yielded  similar,  uniformly  negative 
results.  That  is,  substitution  of  the  RING  finger, 
B-box-coiled-coil,  or  coiled-coil  region  alone  from 
MIDI  for  the  corresponding  region  in  KAP-1  failed 
to  recapitulate  KRAB  domain  binding.  Together, 
these  results  with  chimeric  RBCC  proteins  strongly 
suggest  that  the  RBCC  domain  of  KAP-1  binds  as 
an  integrated  functional  unit  to  its  ligand,  and  that 
each  sub-domain  contributes  to  protein  recognition 
and/or  oligomerization.  It  is  noteworthy  that  the 
intact  RBCC  domains  from  three  independent 
RBCC  proteins,  TIFloc,  MIDI  and  TIFly  (Venturini 
et  ah,  1999)  all  fail  to  bind  the  KRAB  domain. 
These  observations  underscore  the  apparent  high 
degree  of  specificity  for  ligand  recognition  that  is 
built  into  the  RBCC  domain  of  KAP-1  and  prob¬ 
ably  other  RBCC  proteins. 

An  important  point  is  whether  endogenous 
RBCC  proteins  form  hetero-oligomers  via  RBCC- 
RBCC  interactions.  These  interactions  would  serve 
to  increase  the  combinatorial  complexity  for  target 
recognition  by  hetero-oligomeric  RBCC  complexes. 
It  is  interesting  to  note  that  apparent  hetero-trimer- 
ization  has  been  observed  among  certain  members 
of  the  TRAF  subfamily  of  RBCC  proteins,  although 
it  is  unclear  whether  this  alters  the  biological  sig¬ 
naling  properties/specificities  of  these  molecules 
(Pullen  et  ah,  1999).  Even  among  the  clearly  paralo- 
gous  TIF1  family  of  RBCC  proteins,  only  indirect 
weak  binding  between  family  members  has  been 
detected  (Venturini  et  ah,  1999).  Rather  remarkably. 
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the  MIDI  and  MID2  RBCC  proteins,  which  show 
overall  77%  amino  acid  sequence  identity,  fail  to 
hetero-dimerize  in  vivo  (Buchner  et  ah,  1999; 
Cainarca  et  al,  1999).  However,  there  is  indirect 
evidence  that  RFP  can  hetero-oligomerize  and  co¬ 
localize  with  PML,  and  that  this  activity  requires 
the  B  box  and  coiled-coil  regions  of  RFP  (Cao  et  ah, 
1998).  Clearly,  studies  with  purified  components  as 
demonstrated  here,  are  required  to  resolve  this 
important  issue. 

Our  primary  goal  in  this  study  was  to  begin  to 
establish  a  set  of  principles  for  oligomerization  and 
ligand  recognition  for  the  RBCC  family  of  proteins 
using  the  KRAB-KAP-1  system  as  a  model.  How¬ 
ever,  equally  important  are  the  implications  that 
our  findings  have  for  defining  the  biological  func¬ 
tions  of  KAP-1  as  a  corepressor  in  vivo.  The  KRAB 
domain  is  currently  represented  in  more  than  100 
proteins,  most  of  which  are  C2H2  zinc-finger  pro¬ 
teins  and  are  likely  to  function  as  gene-specific 
repressors.  Although  we  have  exclusively  utilized 
the  KRAB  domain  of  KOX1,  more  than  ten  inde¬ 
pendent  KRAB  domain  proteins  have  been  shown 
to  bind  KAP-1  and  thus  it  is  a  reasonable  assump¬ 
tion  that  the  vast  majority  of  this  family  will  utilize 
KAP-1  as  their  corepressor.  Our  current  working 
model  for  KRAB-ZFP-KAP-1 -mediated  repression 
is  illustrated  in  Figure  10.  A  KRAB-ZFP  binds 
DNA  sequence-specifically  through  its  array  of 
C2H2  zinc  fingers.  The  DNA-bound  KRAB  domain 
recruits  the  KAP-1  corepressor  via  a  direct  inter¬ 
action  with  the  RBCC  domain.  Binding  may  induce 
oligomerization  of  KAP-1,  which  is  required,  or 
KAP-1  may  be  pre-existing  in  an  oligomeric  state. 
Binding  of  KAP-1  by  the  KRAB-ZFP  in  proximity 
to  the  transcription  start  site  is  necessary  but  not 
sufficient  for  repression.  The  KAP-1  repression 
domains  comprised  of  the  HP1  binding  domain, 
the  PHD  finger  and  the  bromodomain,  which 


together  function  to  assemble  the  machinery 
required  for  repression.  A  major  goal  has  been  to 
define  this  machinery  "downstream"  of  KAP-1, 
which  is  required  and,  in  light  of  the  evidence  pre¬ 
sented  here,  whether  KAP-1  oligomerization  also 
plays  a  role  in  this  recruitment.  A  likely  direct  tar¬ 
get  of  KAP-1  in  this  context  was  recently  defined 
by  us  and  others  as  the  heterochromatin  protein 
HP1  (Ryan  et  ah,  1999).  The  HP1  proteins  are  a 
small  family  of  low  molecular  mass,  non-histone 
chromosomal  proteins,  some  of  which  are  tightly 
associated  with  silenced  heterochromatin.  We  have 
shown  that  KAP-1  directly  binds  to  purified 
recombinant  HP1  protein,  and  that  a  stable  qua¬ 
ternary  complex  can  be  formed  between  DNA,  a 
KRAB-ZFP,  KAP-1  and  HP1.  Mutations  in  KAP-1 
that  abolish  HP1  binding  significantly  impair 
KAP-l-mediated  repression.  Furthermore,  HP1  and 
KAP-1  extensively  colocalize  to  heterochro- 
matic  regions  in  interphase  nuclei.  Thus,  it  is 
tempting  to  speculate  that  KAP-1  serves  to  nucle¬ 
ate  HP1 -mediated  stable  repression  on  a  template, 
and  that  the  oligomerization  of  KAP-1  as  it  binds 
the  KRAB  domain  has  a  role  in  the  nucleation  of 
heterochromatin. 

In  summary,  we  have  provided  the  first  analysis 
of  a  direct  protein-protein  interaction  involving  an 
RBCC  protein  and  its  physiologically  relevant  pro¬ 
tein  ligand.  These  studies  have  highlighted  the 
apparently  high  specificity  for  recognition  inherent 
in  this  protein-protein  interface.  Further  refinement 
of  these  macromolecular  interactions  could  lead  to 
strategies  for  harnessing  KAP-l-mediated  repres¬ 
sion  in  vivo  using  synthetic  transcription  factors  for 
possible  therapeutic  benefit.  One  could  also  envi¬ 
sion  a  small-molecule  inhibitor  strategy  directed  at 
the  KRAB-KAP-1  system,  which  may  be  useful  for 
reprogramming  the  expression  of  stably  repressed 
genes  and  thus  altering  cellular  phenotypes. 


Organization  of 
Heterochromatin 


Figure  10.  Model  for  the  KRAB-KAP-1  interaction.  This  model  is  described  in  Discussion. 
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Materials  and  Methods 

Preparation  of  plasmids 

The  plasmid  expressing  the  6His  GAL4-KRAB  (1-90) 
protein  was  constructed  via  polymerase  chain  reaction 
(PCR)  using  the  pMl-KOXl,  1-90  plasmid  (Margolin 
et  al.f  1994)  as  a  template.  A  5'  oligonucleotide,  which 
incorporated  a  BamHl  site  immediately  5'  to  the  GAL4 
initiator  methionine  residue  and  a  3'  oligonucleotide, 
which  incorporated  a  stop  codon  and  a  Hzndlll  site  after 
amino  acid  residue  90  of  KOX1,  were  used  to  amplify 
the  desired  sequence.  The  digested  PCR  product  was 
cloned  into  the  corresponding  restriction  sites  of  the 
pQE30  vector  (Qiagen). 

The  plasmid  expressing  the  6His  KRAB  (1-90)  protein 
was  produced  by  subcloning  the  DNA  segment  encod¬ 
ing  residues  1-90  of  KOX1-KRAB  from  pMl  into  the 
pSP73  vector,  using  the  5'  EcoRI  and  3'  Xbal  sites.  This 
fragment  was  then  excised  and  subcloned  via  the  flank¬ 
ing  BamHl  and  Psfl  sites  in  pSP73  into  the  corresponding 
sites  of  the  pQE30  vector.  The  protein  expressed  from 
this  plasmid  contained  the  18  NH2-terminal  amino  acid 
residues,  MRGSHHHHHHGSDIIDEF,  followed  by  resi¬ 
dues  1-90  of  the  KOX1  protein,  then  the  C-terminal 
sequence  RVDLQPSLIS  is  present. 

The  pQE30  KAP-l-RBCC  plasmid  was  constructed  by 
amplifying  the  DNA  fragment  encoding  residues  22-418 
of  human  KAP-1  using  the  primer  pair,  forward  5'  CGT 
TGA  ATT  CTA  CCA  TGG  CTT  CGT  GGG  GAT  CCC  3', 
reverse  5'  CGT  AAA  GCT  TGC  CTG  TTG  AGT  TAG  TGC 
C  3'.  The  PCR  product  was  cleaved  with  EcoRI 
and  HmdIII,  and  cloned  into  pBluescript  II  KS  +/—  to 
generate  pBS-KS  KAP-l-RBCC.  This  fragment  was  then 
excised  with  the  flanking  5'  Smal  site  and  3'  HmdIII  sites 
in  pBluescript  II  KS  +/-  and  cloned  into  the  pQE30 
plasmid.  The  expressed  KAP-l-RBCC  protein  contained 
the  18  NH2- terminal  amino  acid  residues, 
MRGSHHHHHHGSACELGT,  followed  by  residues  22- 
418  of  the  KAP-1  protein,  then  the  C-terminal  sequence 
KLN.  The  point  mutations  within  the  RBCC  domain  of 
KAP-1  were  generated  in  this  plasmid  using  PCR- 
mediated  mutagenesis.  The  mutagenic  primers  contained 
the  following  codons:  (CC65,68AA),  TGC  to  GCC; 
(CCC152,153,156AAA),  TGC  to  GCC,  TGT  to  GCT; 
(C209A,H212A),  TGC  to  GCC,  CAC  to  GCC;  (L270P), 
TTG  to  CCG;  (L306P),  CTG  to  CCG;  (V260N),  GTG  to 
AAC;  (V260D),  GTG  to  GAC. 

A  baculovirus  vector  (pVL1392)  encoding  residues 
22-418  of  KAP-1  was  generated  by  first  digesting 
pBS-KS  KAP-l-RBCC  with  HmdIII.  The  overhang  in  the 
linearized  plasmid  was  filled  in  with  Klenow  enzyme, 
and  then  the  DNA  was  digested  with  EcoRI.  This 
fragment  was  cloned  into  pVL1392  (PharMingen)  at 
the  EcoRI  and  Smal  sites.  The  protein  expressed  con¬ 
tained  the  16  NH2-terminal  amino  acid  residues, 
MASWGSHHHHHHDPGY,  followed  by  residues  22-418 
of  the  KAP-1  protein,  then  the  C-terminal  sequence 
KLGSFPGTRQEPKTHSLQGNP. 

The  pQE30  TIFloc-RBCC  plasmid  was  produced  by 
amplifying  the  DNA  fragment  encoding  residues  1-414 
of  TTFloc  using  the  primer  pair,  forward  5'  GCA  TAG 
ATC  TAT  GGA  GGT  GGC  TGT  GGA  GAA  G  3';  reverse 
5'  CGA  TGC  ATG  CTC  AGA  GCC  CAG  AAA  CTA 
GGA  TC  3'.  The  PCR  product  was  digested  with  Bglll 
and  Sphl,  and  cloned  into  the  BamHl  and  Sphl  sites  of 
the  pQE30  vector. 

The  plasmids  expressing  the  KAP-1  RB  region  fused 
to  various  heterologous  oligomerization  domains  and 


KAP-1 /MID-1  domain  swaps  were  generated  by  over¬ 
lap-extension  PCR-mediated  mutagenesis.  The  GCN4-LZ 
and  GCN4-TZ  domains  have  been  described  and 
spanned  residues  253-281  and  250-281  of  GCN4,  respect¬ 
ively  (Waterman  et  ah ,  1996).  The  Saccharomyces  cerevisiae 
heat-shock  transcription  factor  (HSF)  trimerization 
domain  spanning  residues  321-441  was  used  (Drees  et  al, 
1997).  The  forward  primers  are:  KAP-RB-[HSFXri],  KAP- 
RB-[GCN4Dim],  KAP-RB-[GCN4Tet],  RBKAP-CCMID,  R^p- 
BCCmid,  5'  AGC  GGA  TCC  GAA  TTC  ACC  ACC  ATG 
GCT  CAC  CAT  CAC  CAC  CAT  CAC  3'  (BamHl);  MID- 
RBCC,  Rmid-BCC^p,  5'  GCA  TAG  ATC  TAT  GGA  AAC 
ACT  GGA  GTC  AG  3'  (Bglll).  The  reverse  primers  are: 
KAP-RB-[HSFTri],  5'  CGT  TAA  GCT  TTC  AAG  TGC 
CTG  TGT  TGT  CGC  G  3'  (HmdIII);  KAP-RB-[GCN4Dim], 
5'  CGT  TAA  GCT  TTC  AGC  GTT  CGC  CAA  CTA  A  3' 
(HmdIII);  KAP-RB-[GCN4Tet],  5'  CGT  TAA  GCT  TTC 
AAC  GTT  CAC  CCA  ATA  ATT  T  3'  (HmdIII);  MID- 
RBCC,  5'  CGA  TGC  ATG  CTC  AAT  CTA  AGG  CAA 
AGG  TGT  C  3'  (Sphl);  RBKAP-CCMID,  Rkap-BCCmid,  5' 
GAT  CAA  GCT  TTC  AAT  CTA  AGG  CAA  AGG  TGT  C 
3'  (HmdIII);  RMiD-BCCkap,  5'  ATG  CAA  GCT  TTC  ATG 
TTG  AGT  TAG  TGC  CAG  GAC  3'  (HmdIII).  Fusion  pri¬ 
mers  (forward)  are:  KAP-RB-[HSFXri],  5'  TGA  GGA  ACC 
AGC  GCA  AGA  ATG  GTC  ATT  TAT  TGC  AGG  3'; 
KAP-RB-[GCN4Dim],  5'  TGA  GGA  ACC  AGC  GCA  AGA 
TCC  GCG  GGC  GTC  AGC  TT  3';  KAP-RB- [GCN4Xet],  5' 
TGA  GGA  ACC  AGC  GCA  AGA  TCC  GCG  GTA  ATC 
GTC  TG  3';  RBKAP-CCMID,  5'  GTG  AGG  AAC  CAG  CGC 
AAG  AAG  CAA  AAC  TTA  GAG  AGT  AAC  3';  RKAP- 
BCCmid,  5'  AAT  TAT  TTC  ATG  CGT  GAT  AGT  GGG 
CTC  AAG  CGC  AAC  GTC  ACC  3';  RMrD-BCCKAP,  5' 
CTC  AGC  CAG  CGA  GGT  CTA  GAC  GGC  AGC  AAG 
GCT  GCC  ACC  GAC  3'.  The  second-round  PCR  pro¬ 
ducts  were  digested  with  the  appropriate  enzymes 
defined  for  each  fusion  in  parentheses  above  and  sub¬ 
cloned  into  compatible  restriction  sites  of  the  pQE30  vec¬ 
tor.  All  plasmids  generated  by  PCR  were  confirmed  by 
DNA  sequencing  on  both  strands  to  verify  the  presence 
of  desired  mutations  and  appropriate  fusions  (Wistar 
DNA  Core  Facility). 

Protein  purification 

Escherichia  coli  SG13009  cells  (Qiagen)  bearing  the 
desired  plasmid  were  propagated  with  aeration  at  37  °C 
in  one  liter  of  Luria  broth  to  an  A600  of  approximately 
0.6.  IPTG  was  added  to  1  mM,  and  growth  at  37  °C  was 
continued  for  four  hours.  The  cells  were  harvested  by 
centrifugation. 

The  purification  of  KOX1-KRAB  protein  was  per¬ 
formed  at  room  temperature  (Shi  et  al,  1997).  For  one 
liter  of  cell  culture,  the  cell  pellet  was  resuspended  in 
40  ml  of  P300  buffer  (10  mM  Na2HP04,  1.4  mM 
KH2P04,  2.7  mM  KC1,  450  mM  NaCl,  pH  7.0, 1  mM  phe- 
nylmethylsulfonyl  fluoride  (PMSF)).  The  cells  were  lysed 
by  sonication  on  ice  and  the  cell  extract  was  centrifuged 
at  12,000  g  for  20  minutes.  The  pellet  was  dissolved  in 
40  ml  of  lysis  buffer  (20  mM  Tris-HCl  (pH  7.8),  0.5  M 
NaCl,  5  mM  imidazole,  8  M  urea),  and  incubated  for  one 
hour.  The  lysate  was  then  centrifuged  at  12,000  g  for  20 
minutes  at  4°C.  To  the  supernatant,  1-1.5  ml  packed  col¬ 
umn  volume  of  Ni-NTA  resin  (Qiagen),  previously  equi¬ 
librated  in  lysis  buffer,  was  added  in  batch  and  the 
incubation  continued  with  rotation  for  one  hour.  The 
resin  was  then  washed  with  lysis  buffer  and  loaded  onto 
a  disposable  column,  followed  by  an  additional  volume 
of  wash  buffer  (20  mM  Tris-HCl  (pH  7.8),  0.5  M  NaCl, 
20  mM  imidazole,  8  M  urea).  The  KRAB  protein  bound 
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to  the  nickel  column  was  renatured  on  the  column  by 
stepping  the  urea  in  lysis  buffer  from  8  M  to  0  M.  The 
renatured  KRAB  protein  was  eluted  with  elution  buffer 
(20  mM  Tris-HCl  (pH  7.8),  0.5  M  NaCl,  0.3  M  imida¬ 
zole).  The  protein  was  dialyzed  against  20  mM  Tris-HCl 
(pH  8.0),  0.5  M  NaCl,  followed  by  a  second  dialysis 
reducing  the  salt  to  50  mM  NaCl. 

The  KAP-l-RBCC  proteins,  mutants  thereof,  and  the 
chimeras,  were  purified  using  both  denaturing  and 
non-denaturing  conditions.  The  purification  was  initially 
performed  at  4°C  under  non-denaturing  condition  as 
recommended  by  the  manufacturer  (Qiagen).  The  bac¬ 
terial  pellet  was  resuspended  in  P300  buffer  (pH  7.0), 
10%  (v/v)  glycerol,  10  mM  imidazole,  and  lysed  by 
sonication.  The  cell  extract  was  centrifuged  at  12,000  g 
for  20  minutes.  Hie  insoluble  pellet  was  then  used  for 
purification  under  denaturing  conditions  (see  below). 
The  supernatant  fraction  containing  soluble  protein  was 
incubated  in  batch  with  Ni-NTA  resin  for  one  hour.  The 
resin  was  washed  with  P300  buffer.  Then  the  resin  was 
loaded  into  a  disposable  column,  and  washed  with  P300 
buffer  at  pH  6.0.  The  protein  was  eluted  with  P300 
(pH  7.0),  10  %  glycerol,  300  mM  imidazole.  The  eluted 
protein  was  dialyzed  against  three  changes  of  P300 
(pH  7.0),  10%  glycerol,  20  pM  ZnS04,  0.5  mM  DTT). 

The  insoluble  proteins  from  the  pellet  fraction  were 
purified  and  eluted  from  the  Ni-NTA  with  300  mM  imi¬ 
dazole  and  low  pH  under  denaturing  condition  as  rec¬ 
ommended  by  the  manufacturer.  The  eluted  proteins 
were  diluted  to  20  ml  with  P300,  and  dialyzed  against 
P300  (pH  7.0),  10%  glycerol,  20  pM  ZnS04,  0.5  mM 
DTT.  After  dialysis,  the  protein  was  clarified  by  centrifu¬ 
gation  and  concentrated  in  an  Amicon  concentrator 
(Amicon)  following  the  manufacturer's  instruction. 


Electrophoretic  mobility  shift  assay  (EMSA) 

EMSA  assays  were  performed  essentially  as  described 
(Ryan  et  al,  1999;  Fredericks  et  ah,  1995).  The  purified 
recombinant  GAL4-KRAB  or  GAL4-KRAB  (DV-AA)  pro¬ 
tein  was  incubated  in  binding  buffer  with  either  purified 
bacterial  KAP-l-RBCC,  KAP-l-RBCC  mutant/fusion  pro¬ 
teins,  or  baculovirus  whole-cell  lysates  for  15  minutes  at 
30  °C,  and  then  1  pi  of  32P-labeled  GAL4  probe  (  105 
cpm/pl)  was  added  and  the  reaction  was  incubated  for 
an  additional  15  minutes  at  30  °C.  The  DNA:protein 
complexes  were  resolved  on  native  polyacrylamide  gels 
by  electrophoresis  at  450  V  for  one  hour  and  45  minutes 
in  45  mM  Tris-borate  (pH  8.3),  1  mM  EDTA  buffer  at 
4°C.  The  EMSA  gels  were  dried  and  visualized  by  auto¬ 
radiography.  The  GAL4  was  the  double-stranded  syn¬ 
thetic  oligonucleotide  5'  GAT  CCC  GGA  GGA  CAG 
TAC  TCC  GT  3',  which  was  labeled  with  [32P]ATP  as 
described  (Fredericks  et  al ,  1995). 


Reconstitution  of  the  KOX1 -KRAB: KAP-l-RBCC 
complex  in  vitro 

To  reconstitute  the  KOXl-KRAB:KAP-l-RBCC  com¬ 
plex  in  vitro ,  highly  purified  bacterial  KOX1-KRAB  and 
KAP-l-RBCC  proteins  were  incubated  at  a  1:1  molar 
ratio  in  buffer  containing  20  mM  Hepes,  300  mM  NaCl, 
0.5  mM  DTT,  10%  glycerol  at  30  °C  for  30  minutes.  The 
samples  were  chilled  at  4°C  for  five  minutes,  and  then 
centrifuged  at  10,000  g  for  five  minutes  at  4°C.  The 
supernatant  was  analyzed  by  gel-filtration,  sucrose  gra¬ 
dient  sedimentation,  and  chemical  cross-linking. 


Gel-filtration  analysis 

Gel-filtration  of  highly  purified  proteins  was  per¬ 
formed  using  a  Superdex  200  HR  10/30  column  (Phar¬ 
macia)  equilibrated  in  P300  buffer  (pH  7.0),  10% 
glycerol,  0.1  %  NP-40,  20  pM  ZnS04,  1  mM  DTT,  using 
an  FPLC  system  (Pharmacia).  The  column  was  run  at 
4°C  at  a  flow-rate  of  0.5  ml /minute  and  1  ml  fractions 
were  collected.  Aliquots  of  each  fraction  were  analyzed 
by  SDS-PAGE  or  Western  blot  (Ryan  et  al,  1999). 

Sucrose  gradient  sedimentation 

Purified  bacterial  KAP-l-RBCC  protein  (60  pg),  KOX1- 
KRAB  (60  pg)  or,  KOX1-KRAB  (60  pg):KAP-l-RBCC 
(200  pg)  complex  in  200  pi  of  sucrose  gradient  buffer 
(10  mM  sodium  phosphate  (pH  7.0),  140  mM  NaCl, 
0.5  %  NP-40,  1  mM  PMSF,  2  pg/ml  aprotinin,  3  pg/ml 
leupeptin)  was  layered  onto  a  discontinuous  5  %  to  60  % 
(w/v)  sucrose  gradient.  The  gradients  were  pre-formed 
at  room  temperature  for  one  hour,  followed  by  an 
additional  hour  at  4  °C  prior  to  loading  the  sample.  The 
sucrose  gradients  were  centrifuged  for  16  hours  at  4°C 
in  an  SW-50.1  rotor  at  190,000  g.  Approximately  20  frac¬ 
tions  (250  pi  each)  were  collected  from  the  base  of  each 
centrifuge  tube.  Aliquots  of  each  fraction  were  subjected 
to  SDS-PAGE,  and  the  proteins  were  stained  with 
Coomassie  brilliant  blue. 

Non-denaturing  (ND)-PAGE 

ND-PAGE  was  conducted  by  the  procedure  described 
in  the  ND  protein  Molecular  Weight  Determination  Kit 
from  Sigma.  Purified  bacterial  KAP-l-RBCC  protein 
(10  pg)  and  10  pg  of  each  protein  standard  were  ana¬ 
lyzed  by  ND-PAGE  on  gels  of  various  polyacrylamide 
concentrations  (6,  7,  8,  and  10  %),  and  the  proteins  were 
stained  with  Coomassie  brilliant  blue. 

Analytical  ultracentrifugation 

The  KAP-l-RBCC  (0.7  to  2.0  mg/ml)  sample  was  pre¬ 
pared  for  analytical  ultracentrifugation  by  dialysis 
against  a  P300  buffer  (pH  7.0),  10%  glycerol,  20  pM 
ZnS04,  1  mM  DTT.  The  sedimentation  equilibrium 
experiments  were  performed  in  an  Optima  XL-I  ultracen¬ 
trifuge,  using  the  interference  optics  to  determine  the 
protein  concentration  gradient.  For  each  experiment, 
three  cells  were  assembled  with  double-sector  12  mm 
centerpieces  and  sapphire  windows,  and  loaded  with 
110  pi  of  reference  buffer  (dialysate)  and  110  pi  of 
sample,  at  three  different  protein  concentrations.  A  blank 
scan  of  distilled  water  was  taken  before  the  run,  to  cor¬ 
rect  for  the  effects  of  window  distortion  on  the  fringe 
displacement  data  (Yphantis,  1964).  The  experiments 
were  performed  at  4-25  °C,  at  a  speed  of  13,000  rpm. 
Fringe  displacement  data  were  collected  every  six  to 
eight  hours  until  equilibrium  was  reached,  as  deter¬ 
mined  by  comparison  of  successive  scans  using  the 
MATCH  program,  and  the  data  were  edited  using  the 
REEDIT  program  (both  programs  kindly  provided  by 
David  Yphantis). 

Analysis  of  sedimentation  equilibrium  data  was  per¬ 
formed  using  the  NONLIN  program  (Johnson  et  al. , 
1981).  The  partial  specific  volume  of  KAP-l-RBCC  was 
calculated  from  the  amino  acid  sequence,  and  the  density 
of  the  solvent  was  calculated  according  to  Laue  et  al 
(1992).  Three  data  sets  from  different  loading  concen- 
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trations  were  fitted  simultaneously.  Goodness  of  fit  was 
determined  by  examination  of  the  residuals  and  mini¬ 
mization  of  the  variance. 

Chemical  cross-linking 

Chemical  cross-linking  of  recombinant  proteins  using 
ethylene  glycol  bis  (succinimidylsuccinate)  (EGS)  (Pierce) 
was  performed  essentially  as  described  (Drees  et  ah, 
1997)  with  the  following  modifications.  For  analysis  of 
the  oligomerization  of  wild-type  or  mutant  KAP-l-RBCC 
proteins,  or  KOXl-KRAB:KAP-l-RBCC  complexes,  the 
final  protein  concentration  was  5  \iM  in  each  reaction. 
The  EGS  cross-linking  was  performed  at  room  tempera¬ 
ture  for  five  minutes  in  P300  buffer  (pH  7.0),  10  %  glycer¬ 
ol,  1  mM  DTT.  The  EGS  was  freshly  prepared  in 
dimethylsulfoxide  and  was  added  from  a  stock  solution 
to  final  concentrations  ranging  from  0.078  to  5  mM.  The 
reactions  were  terminated  by  addition  of  lysine  to  a  final 
concentration  of  20  mM.  After  cross-linking,  10  gl  of  5x 
Laemmli  sample  buffer  was  added  to  the  reactions,  and 
the  samples  were  analyzed  by  SDS-PAGE.  The  proteins 
were  detected  by  Western  blot  analysis  as  described 
(Ryan  et  al,  1999). 

Expression  of  KAP-1  and  KAP-l-RBCC  in 
baculovirus-infected  cells 

The  KAP-l-RBCC  proteins  and  the  full-length  KAP-1 
protein  were  expressed  individually  or  co-expressed  in 
baculovirus-infected  Sf9  cells  (Wistar  Core  Protein 
Expression  Facility).  High-titer  baculovirus  stocks  for 
each  protein  were  prepared  using  standard  techniques 
and  Sf9  cells  were  infected  for  48  hours.  For  a  50  ml  cell 
culture,  the  cell  pellets  were  lysed  in  5  ml  of  10  mM 
Tris-HCl  (pH  8.0),  150  mM  NaCl,  1  %  NP-40,  1  mM  ben- 
zamidine,  1  mM  PMSF,  2  mM  leupeptin,  10  jig /ml  pep- 
statin,  10  gg/ml  aprotinin,  and  10  mM  NaF.  The  cells 
were  lysed  at  4  °C  for  20  minutes,  followed  by  centrifu¬ 
gation  at  16,000  g  for  ten  minutes.  The  protein  was  then 
quantified  by  Bradford  assay  (Bio  Rad).  For  protein-pro¬ 
tein  association  assays,  the  supernatant  fraction  (250  |ig 
protein)  of  an  individual  protein  preparation  was  added 
to  50  gl  of  50%  (w/v)  Ni-NTA  resin  slurry,  and  incu¬ 
bated  at  4°C  for  one  hour.  The  resin  was  washed  with 
P300  (pH  7.0),  0.1  %  NP-40,  1  mM  PMSF.  The  bound 
proteins  were  extracted  from  the  resin  with  5  x  Laemmli 
sample  buffer,  resolved  by  SDS-PAGE,  and  detected  by 
Western  blot  (Ryan  et  al. ,  1999). 
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Abstract 

The  PHD  finger  and  bromodomain  are  two  highly  conserved  protein  motifs  that  are 
spatially  linked  in  a  growing  family  of  transcriptional  regulatory  proteins.  Optimal 
transcriptional  repression  by  KAP-1,  a  universal  corepressor  for  the  KRAB-ZFP  superfamily  of 
transcriptional  repressors,  requires  the  PHD  finger  and  bromodomain  at  the  COOH-terminus, 
which  form  an  integrated  functional  unit.  Mutations  in  the  KAP- 1  PHD  finger  that  mimic 
naturally  occurring  mutations  in  the  PHD  finger  of  hATRX  or  site-directed  mutants  in  the 
bromodomain,  based  on  the  structure,  specifically  disrupted  the  repression  function  of  this 
domain.  Substitution  of  highly  related  PHD  fingers  and  bromodomains  of  other  transcriptional 
regulators  failed  to  functionally  complement  this  repression  activity.  A  yeast  two-hybrid  screen 
revealed  a  specific  interaction  between  this  bipartite  domain  of  KAP-1  and  the  Mi-2a  subunit  of 
the  HD  AC  complex  NuRD.  These  data  suggest  that  the  PHD  finger  and  bromodomain  function 
cooperatively  as  an  integrated  functional  unit  that  provides  an  interface  for  protein-protein 
interactions.  Moreover,  the  KRAB-ZFP:KAP-1  repression  complex  may  function  to  target  the 
histone  deacetylase  and  chromatin  remodeling  activities  of  the  recently  defined  NuRD  complex 
to  specific  gene  promoters  in  vivo. 
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Introduction 

Regulation  of  RNA  polymerase  II  involves  a  complex  interplay  between  DNA-protein 
interactions  and  protein-protein  interactions.  While  the  general  transcription  factors  regulate  the 
accurate  initiation  of  transcription,  it  is  the  proteins  that  bind  gene-specific  DNA  elements  that 
are  instrumental  in  either  negatively  or  positively  regulating  the  rate  of  transcription  (Hampsey 
and  Reinberg,  1999;  Orphanides  et  al.,  1996).  Studies  aimed  at  understanding  the  mechanisms 
of  transcriptional  repression  have  been  greatly  aided  by  the  realization  that  the  domains  which 
mediate  repression  are  often  highly  conserved  amino-acid  sequence  motifs  which  occur  in  one  or 
more  families  of  proteins  with  common  DNA  binding  domains. 

The  KRAB  (Krtippel  associated  box)  domain  is  one  example  of  an  abundant  amino  acid 
sequence  motif  found  at  the  NH2-terminus  of  nearly  one-third  of  all  Kruppel/TFIIIA  type  C2H2 
zinc  finger  proteins  (Bellefroid  et  al.,  1991).  This  highly  conserved  domain  displays  potent, 
DNA-binding  dependent  repression  of  transcription  that  requires  the  KAP-1  corepressor 
(Friedman  et  al.,  1996;  Kim  et  al.,  1996;  Margolin  et  al.,  1994;  Moosmann  et  al.,  1996; 
Moosmann  et  al.,  1997).  The  primary  amino  acid  sequence  of  KAP-1  revealed  the  presence  of 
several  conserved  signature  motifs,  including  a  RING  finger,  B  boxes,  a  coiled-coil  region 
(RBCC),  which  collectively  form  an  integrated  domain  that  is  both  necessary  and  sufficient  to 
directly  interact  with  the  KRAB  domain  (Peng  et  al.,  2000;  Peng  et  al.,  2000).  The  COOH- 
terminal  sequence  of  KAP-1  revealed  a  conserved  PHD  finger  and  bromodomain.  This 
particular  spatial  arrangement  of  motifs  has  defined  an  emerging  family  of  transcriptional 
regulators  which  include  TIFla  and  TIFly  (Le  Douarin  et  al.,  1995;  Venturini  et  al.,  1999).  All 
members  of  this  protein  family  have  been  shown  to  repress  transcription  when  tethered  to  DNA, 
and  the  mechanisms  by  which  they  repress  transcription  are  currently  being  defined.  In  this 
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aspect  a  direct  interaction  between  KAP-1  and  mammalian  members  of  the  non-histone 
chromosomal  protein,  heterochromatin  protein  1  (HP1)  has  been  described  (Le  Douarin  et  al., 
1996;  Nielsen  et  al.,  1999;  Ryan  et  al.,  1999).  Biochemical  studies  indicate  that  this  interaction 
is  dependent  upon  the  chromoshadow  domain  of  HP1  proteins  and  a  core,  hydrophobic 
pentapeptide  sequence  (PxVxL)  in  KAP-1  (Lechner  et  al.,  2000).  Furthermore,  indirect 
immunofluoresence  studies  of  interphase  nuclei  indicated  that  KAP-1  colocalizes  with  both 
heterochromatic  and  euchromatic  HP1  proteins  (Ryan  et  al.,  1999).  These  particular  studies 
suggest  that  the  KRAB: KAP-1  complex  may  mediate  repression  in  part  via  the  maintenance  or 
initiation  of  heterochromatic  chromosomal  environments. 

The  PHD  finger  and  bromodomain  are  two  highly  conserved  protein  motifs  found  in  an 
increasing  number  of  proteins  with  roles  in  regulating  transcription  via  modification  of 
chromatin  structure.  Furthermore,  these  two  motifs  are  commonly  linked  architecturally  in  a 
number  of  proteins  (Bochar  et  al.,  2000;  Friedman  et  al.,  1996;  Jones  et  al.,  2000;  Jones  et  al., 
2000;  Le  Douarin  et  al.,  1996;  Le  Douarin  et  al.,  1995;  Venturini  et  al.,  1999).  Although  both 
motifs  are  abundant,  the  function  of  these  two  motifs  in  the  regulation  of  transcription  remains  to 
be  fully  described.  The  PHD  finger  (Plant  Homeodomain)  is  a  relatively  small  domain  of 
approximately  60  amino  acids  defined  by  a  consensus  amino  acid  signature  of  C4HC3,  which  is 
presumed  to  chelate  zinc  (Aasland  et  al.,  1995).  The  biological  importance  of  the  PHD  finger  in 
protein  function  is  underscored  by  the  identification  of  naturally  occurring  germline  mutations 
resulting  in  single  amino  acid  substitutions  in  the  PHD  finger  of  the  hATRX  protein,  which 
predispose  individuals  to  a-thalassemia  and  a  variety  of  developmental  defects  (Gibbons  et  al.. 


1997). 
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The  bromodomain  is  another  conserved  signature  motif  of  approximately  100  amino 
acids  that  is  found  in  30-40  independentproteins  (Jeanmougin  et  al.,  1997).  The  solution 
structure  of  the  bromodomain  of  pCAF  indicates  that  these  amino  acids  fold  into  four 
amphipathic  cc-helices,  and  that  acetyl-lysine  groups  of  histone  tails  may  be  a  natural  ligand  for 
this  structural  motif  (Dhalluin  et  al.,  1999).  In  this  aspect,  synthetically  derived  mutations  in  the 
bromodomain  of  the  yeast  proteins  RSC1  and  GCN5  were  observed  to  impair  the  chromatin 
related  functions  of  these  proteins  in  vitro  and  in  vivo,  without  effecting  the  ability  of  these 
proteins  to  assemble  into  multiprotein  complexes  (Cairns  et  al.,  1999;  Sterner  et  al.,  1999; 
Syntichaki  et  al.,  2000).  These  data  suggest  that  the  bromodomain  is  involved  in  modulating  the 
activity  of  these  transcriptional  regulatory  complexes. 

Our  data  from  the  analysis  of  the  PHD  finger  and  bromodomain  of  KAP- 1  suggests  that 
the  spatial  conservation  of  these  motifs  is  essential  for  optimal  protein  function.  These  data  are 
consistent  with  the  PHD  finger  and  bromodomain  providing  an  interface  for  protein-protein 
interactions,  and  therefore  may  have  broad  implications  in  understanding  the  function  of  proteins 
that  maintain  this  particular  spatial  arrangement  of  motifs.  Comparative  analysis  of  proteins 
containing  these  two  motifs  indicates  that  this  bipartite  domain  is  not  functionally  equivalent 
among  all  proteins.  Thus,  it  is  likely  that  the  function  of  this  integrated  domain  will  be  dependent 
upon  those  proteins  with  which  it  associates.  The  interaction  of  KAP-1  with  Mi-2a/CHD3,  is 
consistent  with  this  hypothesis  and  implies  a  role  for  histone  deacetylases  in  transcriptional 
repression  mediated  by  the  KRAB-ZFP:KAP-1  repressor/corepressor  system. 
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Results 

Several  pieces  of  evidence  indicate  that  KAP- 1  functions  as  a  transcriptional  corepressor 
for  the  KRAB  repression  module,  including  an  intrinsic  ability  to  repress  transcription  when 
directly  tethered  to  DNA  templates  (Agata  et  al.,  1999;  Friedman  et  al.,  1996;  Moosmann  et  al., 
1996;  Nielsen  et  al.,  1999;  Ryan  et  al.,  1999).  To  identify  regions  of  KAP-1  which  are  required 
for  its  transcriptional  repression  function  we  have  engineered  a  series  of  KAP- 1  deletion  mutants 
with  the  heterologous  GAL4  DNA  binding  domain  to  grossly  map  repression  domains  (Figure 
1).  Each  heterologous  fusion  protein  was  tested  for  stable  expression  in  COS  1  cells  and  nuclear 
localization  (data  not  shown).  In  general,  we  have  found  that  amino  acids  COOH-terminal  to  the 
RBCC  region  define  at  least  two  independent  repression  domains  within  KAP-1  that  appear  to  be 
additive  in  nature  in  order  to  obtain  maximal  levels  of  transcriptional  repression.  One  of  these 
domains  includes  the  recently  defined  HP1  binding  domain  of  KAP-1  (Lechner  et  al.,  2000; 

Ryan  et  al.,  1999).  Optimal  repression  of  transcription  was  dependent  upon  the  PHD  finger  and 
bromodomain.  When  autonomously  tethered  to  DNA,  these  two  motifs  demonstrated  a 
significant  level  of  repression,  suggesting  that  they  can  function  as  an  independent  repression 
domain  (Figure  1).  This  repression  activity  was  DNA-binding  dependent,  dose  dependent,  and 
universal  for  all  promoters  tested,  including  the  SV40  immediate  early  enhancer,  the  adenovirus 
major  late  promoter,  and  a  minimal  genomic  promoter  fragment  of  the  DNA  polymerase  (3  gene 
(data  not  shown).  Neither  the  PHD  finger  nor  the  bromodomain  could  independently  recapitulate 
this  repression  activity,  indicating  that  these  two  motifs  function  cooperatively  to  repress 
transcription  (Figure  1).  Furthermore,  expression  of  the  segment  encoding  the  PHD  finger  and 
bromodomain  of  KAP-1  relieved  both  GAL4-KRAB  and  GAL4-KAP-1  directed  repression, 
suggesting  that  repression  by  this  bipartite  domain  is  mediated  by  at  least  one  or  more  titratable 
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effector  molecules  (data  not  shown).  The  combination  of  these  data  suggests  that  the  PHD  finger 
and  bromodomain  of  KAP-1  function  as  an  integrated  unit  to  facilitate  repression  of  transcription 
by  KAP-1 .  Moreover,  these  data  implicate  an  additional  repression  mechanism  that  is 
independent  of  the  recruitment  of  HP1  proteins  (Lechner  et  al.,  2000;  Ryan  et  al.,  1999). 

Mutational  analysis  of  the  KAP-1  PHD  finger  and  bromodomain  in  repression 

In  order  to  correlate  the  molecular  relationship  between  amino  acids  in  the  PHD  finger 
and  bromodomain  and  transcriptional  repression,  we  have  used  a  site-directed  mutagenesis 
approach.  All  amino  acid  substitutions  were  made  in  the  context  of  the  heterologous  GAL4-PHD 
finger/bromodomain  fusion  protein.  Stable  expression  and  nuclear  localization  of  all 
heterologous  fusion  proteins  was  confirmed  in  COS1  cells  (Figure  2C  and  data  not  shown).  The 
effect  of  each  mutation  on  the  repression  properties  of  this  bipartite  domain  was  determined  in 
DNA-template  based  assays.  As  an  initial  strategy,  we  have  targeted  amino  acid  residues  in  the 
PHD  finger  of  KAP-1  (C628R,  V630S,  C651F,  P654C,  and  Q657E;  Figure2A)  which  spatially 
parallel  the  position  of  naturally  occurring  mutations  in  hATRX  that  confer  an  inherited 
susceptibility  to  developmental  defects  (Gibbons  et  al.,  1997).  As  illustrated  in  Figure  2C,  the 
C628R  and  C651F  mutations  significantly  impaired  the  repression  activity  of  this  domain,  while 
the  P654C  mutation  demonstrated  an  intermediate  effect.  Each  of  these  three  amino  acids  is 
highly  conserved  in  the  core  consensus  sequence  (Figure  2A)  and  may  be  involved  in  either 
metal  binding  or  maintenance  of  the  hydrophobic  core.  The  V630S  and  Q657E  mutations  had 
minimal  effects  on  repression,  and  therefore,  may  represent  residues  that  are  more  variable  to  the 
structure  and  function  of  the  KAP-1  PHD  finger. 
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Second,  we  targeted  amino  acid  residues  that  are  strictly  conserved  among  PHD  fingers 
most  closely  related  to  KAP-1.  These  data  indicate  that  mutation  of  any  conserved  cysteine 
residue  (CC  628,  631  AA,  C643A,  C666A;  Figure  2A)  nearly  abolished  all  repression  by  this 
domain  (Figure  2C).  Mutation  of  the  conserved  Trp664  residue  yielded  a  similar  result  as  the 
cysteine  mutations,  suggesting  that  it  may  have  a  role  in  maintaining  the  structure  of  this  domain. 
Although  the  four  remaining  amino  acids  mutated  (G635A,  D636A,  H648C,  and  P660A)  are 
highly  conserved  among  many  members  of  the  PHD  finger  family  (Figure  2A),  mutation  of  each 
demonstrated  marginal  effects  on  the  repression  activity  of  this  bipartite  domain  (Figure  2C). 

The  final  scheme  focused  on  the  PHD  finger  involved  mutating  amino  acids  in  the 
variable  regions  of  the  KAP- 1  PHD  finger  to  residues  present  in  the  PHD  fingers  of 
TIFla/TIFly  (Figure  2A).  Only  the  M639C  mutation  demonstrated  an  effect  on  repression  that 
was  comparable  to  the  P654C  mutation  (Figure  2C).  The  three  remaining  mutations  (C646V, 
F647Y,  and  SL  667,668  TF)  demonstrated  a  minimal  effect  on  the  repression  activity  of  this 
domain,  suggesting  that  KAP-1/TIF1  family  may  utilize  the  PHD  finger  and  bromodomain  to 
repress  transcription  through  a  common  biochemical  mechanism. 

We  also  used  the  information  provided  by  the  structure  of  the  pCAF  bromodomain  to 
design  six  independent  mutations  in  each  of  the  four  a-helices  and  one  mutation  in  the  variable 
loop  region  between  helix  Z  and  helix  A  of  the  KAP-1  bromodomain  (Figure  2B).  As  illustrated 
in  Figure  2C,  mutations  in  helix  B  (F761A)  and  helix  C  (DD  778,781AA,  trunc  781,  and  F791E) 
significantly  impaired  transcriptional  repression  (Figure  2C).  The  S730I  mutation  in  the 
variable  loop  region  between  helix  Z  and  helix  A  also  displayed  an  intermediate  effect  on  the 
repression  potential  of  this  domain,  reaffirming  the  potential  role  for  this  region  in 
macromolecular  recognition.  Overall,  the  data  from  this  mutational  analysis  is  consistent  with 


Schultz  et  al.-  9 


the  PHD  finger  and  bromodomain  of  KAP- 1  functioning  cooperatively,  as  independent 
mutations  in  either  domain  can  significantly  disrupt  the  intrinsic  repression  potential  of  this 
bipartite  domain.  One  interpretation  of  this  data  is  that  these  two  motifs  provide  an  interface  for 
protein-protein  interactions  with  downstream  effectors  of  transcriptional  repression. 

Comparative  analysis  of  the  KAP-1  PHD  finger  and  bromodomain  with  domains  from 
related  proteins 

Mutational  analysis  of  the  KAP-1  PHD  finger  in  which  mutations  designed  to  make  it 
more  similar  to  its  orthologues,  TIFla  and  TIFly,  suggested  that  these  domains  may  function  via 
a  common  biochemical  mechanism.  To  directly  test  this  hypothesis,  we  have  constructed 
heterologous  fusions  between  the  GAL4  DNA  binding  domain  and  the  PHD  finger  and 
bromodomains  of  TIFla,  TIFly,  and  WCRF180  (Bochar  et  al.,  2000)  (Figure  3A).  In  addition, 
we  engineered  chimeric  proteins  in  which  either  the  PHD  finger  or  bromodomain  of  TIFla, 
TIFly,  hATRX,  Mi-2a,  WCRF180,  or  hGCN5  were  substituted  for  the  KAP-1  sequences  in 
order  to  address  the  specificity  of  the  repression  activity  displayed  by  this  bipartite  domain 
(Figure  3A).  Stable  expression  and  nuclear  localization  of  each  chimeric  fusion  protein  was 
confirmed  in  COS1  cells  (Figure  3B  and  data  not  shown).  When  tethered  to  DNA,  the  PHD 
finger  and  bromodomain  of  TIFla  demonstrated  a  potential  to  activate  transcription  of  the 
reporter  template,  while  the  PHD  finger  and  bromodomain  of  TIFly  and  WCRF180  each  failed 
to  demonstrate  any  appreciable  potentiation  of  transcription  (Figure  3A).  Heterologous 
substitution  of  the  KAP-1  PHD  finger  or  bromodomain  with  similar  sequences  from  other 
transcriptional  regulators  failed  to  completely  complement  the  repression  activity  of  the  wild- 
type  KAP-1  PHD  finger  and  bromodomain  (Figure  3A).  Interestingly,  when  the  bromodomain 
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of  KAP-1  was  tethered  with  the  PHD  finger  of  TIFla,  the  chimeric  protein  functioned  to  repress 
transcription  from  the  reporter  template.  In  general,  the  KAP-1  bromodomain,  when  fused  to 
any  PHD  finger,  slightly  enhanced  the  repression  potential  of  the  chimeric  protein,  suggesting 
that  the  bromodomain  of  KAP-1  is  essential  in  mediating  specific  interactions  with  effectors 
which  promote  repression  of  transcription  (Figure  3B).  The  combination  of  these  data  provides 
additional  support  that  the  PHD  finger  and  bromodomain  of  KAP- 1  function  as  an  integrated 
functional  unit  to  specifically  repress  transcription.  The  data  also  imply  that  not  all  PHD  fingers 
and  bromodomains  are  functionally  equivalent,  despite  demonstrating  homology  in  the  primary 
amino  acid  sequence.  Furthermore,  the  spatial  conservation  of  these  two  motifs  architecturally 
in  proteins  is  not  sufficient  to  infer  a  common  function  for  this  bipartite  domain. 

Identification  of  downstream  effectors  of  KAP-1  PHD/bromodomain-mediated 
transcriptional  repression 

To  identify  potential  mechanisms  of  transcriptional  repression  directed  by  the  KAP-1 
PHD  finger  and  bromodomain,  a  heterologous  fusion  between  the  LEXA  DNA  binding  domain 
and  the  PHD  finger  and  bromodomain  of  KAP-1  was  used  in  a  yeast  two-hybrid  screen.  To 
confirm  the  specificity  of  any  potential  protein  interactions  identified  in  this  screen,  we  utilized 
several  of  the  amino  acid  substitutions  engineered  for  the  evaluation  of  the  structure/function 
analysis  of  this  bipartite  domain.  Appropriate  expression  of  the  heterologous  fusion  proteins  was 
confirmed  by  Western  blot  analysis  and  bait  proteins  displayed  no  ability  to  self-activate  the 
selection  promoters  (data  not  shown). 

From  40  million  library  transformants  four  different  gene  products  were  recovered  based 
on  coactivation  of  both  the  integrated  LEXA  responsive  His3  and  LacZ  reporter  genes.  The 
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nucleotide  sequence  of  three,  independent  transformants  of  one  gene  product  indicated  that  each 
clone  possessed  an  identical  fusion  junction  to  the  GAL4  activation  domain  and  that  the 
sequence  was  identical  to  the  3’  nucleotide  sequence  of  the  dermatomyositis-specific 
autoantigen,  Mi-2a/CHD3.  Upon  reintroduction  into  yeast,  this  GAL4  activation  domain  fusion 
protein  displayed  robust  ability  to  activate  the  LacZ  reporter  with  the  wild-type  LEXA-KAP-1 
fusion  protein  (Figure  4A),  but  failed  to  activate  the  reporter  with  other  irrelevant  test  baits  (data 
not  shown).  Two  PHD  finger  mutants  (V630S  and  G635A)  that  demonstrated  nearly  wild-type 
levels  of  transcriptional  repression  activity  also  displayed  the  ability  to  activate  the  LacZ  reporter 
(Figure  4A).  In  contrast,  PHD  finger  mutants  CC628,631AA  ,  C651F,  and  W664A,  which 
possess  impaired  repression  activity,  either  failed  to  activate  these  reporters  or  activated  with 
reduced  efficiency,  respectively  (Figure  4A).  This  putative  protein-protein  interaction  was  also 
observed  to  be  dependent  on  an  intact  bromodomain,  as  the  781  truncation  mutation  displayed 
reduced  affinity  for  the  Mi-2a  in  the  two-hybrid  assay  (Figure  4A).  These  observations  are 
completely  consistent  with  the  transcriptional  effects  observed  in  transient  transfection  assays 
(Figure  2C)  in  which  the  repression  activity  was  dependent  upon  both  domains.  Furthermore, 
mutations  in  either  the  PHD  finger  or  the  bromodomain  were  independently  sufficient  to  ablate 
the  repression  function  of  this  bipartite  domain.  The  combination  of  these  data  strengthens  the 
argument  that  the  PHD  finger  and  bromodomain  of  KAP-1  function  as  an  integrated  functional 
unit,  which  provides  an  interface  for  protein-protein  interactions  with  molecules  that  facilitate 
repression  by  the  KRAB: KAP-1  complex. 

Mi-2a/CHD3  is  a  member  of  the  CHD  family  of  proteins  which  were  named  based  on 
the  possession  of  several  signature  motifs.  Chromodomains,  ATPase/Helicases,  and  DNA 
binding  modules  (Delmas  et  al.,  1993;  Woodage  et  al.,  1997).  Mi-2a/CHD3  is  closely  related  to 
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the  Mi-2|3/CHD4  protein.  Comparison  of  the  amino  acids  in  the  putative  KAP-1  interaction 
domain  of  Mi-2a/CHD3  to  Mi-2|3/CHD4  revealed  that  these  two  proteins  are  80%  identical,  yet 
all  three  KAP-1  interacting  clones  rescued  in  this  screen  possessed  sequence  specific  to  Mi- 
2a/CHD3.  Based  on  the  sequences  of  these  clones,  amino  acids  1686  to  2000  harbor  the  minimal 
KAP-1  interaction  domain  (KID)  (Figure  4B).  This  sequence  encodes  a  protein  with  an 
additional  30  amino  acids  at  the  COOH-terminus  of  the  protein  that  are  not  present  in  the 
putative  protein  described  by  Woodage  et  al.  for  CHD3  (accession  number  AF006515),  yet  the 
remainder  of  the  two  sequences  were  identical.  To  test  the  functional  significance  of  the  novel 
30  amino  acids  contained  in  our  rescued  clones  versus  the  COOH-terminal  sequence  reported  for 
CHD3,  and  the  ability  of  Mi-2(3/CHD4  to  interact  with  KAP-1,  we  engineered  fusions  of  these 
COOH-terminal  sequences  with  the  GAL4  activation  domain,  respectively  (Figure  4B).  Upon 
introduction  into  yeast,  the  wild-type  LEXA-PHD/bromodomain  fusion  of  KAP-1  displayed  no 
activation  of  the  LacZ  reporter  for  either  protein,  indicating  that  the  COOH-termini  of  these  two 
proteins,  CHD3  and  Mi-2p/CHD4,  are  not  functionally  equivalent  to  the  Mi-2a  sequence  we 
rescued  in  the  two-hybrid  screen  (Figure  4B).  Closer  inspection  of  the  COOH-terminal  amino 
acid  residues  revealed  that  Mi-2a/CHD3  quickly  diverges  from  Mi-2[3/CHD4  starting  at  amino 
acid  1909  and  continues  for  an  additional  91  novel  amino  acids.  It  is  these  amino  acid  residues 
that  most  likely  confer  the  specificity  of  the  Mi-2a/CHD3  interaction  with  KAP-1. 

In  order  to  evaluate  the  role  of  Mi-2a/CHD3  in  KAP-1  mediated  transcriptional 
repression,  we  chose  to  disrupt  the  endogenous  KAP-l/Mi-2a  interaction  by  expressing  a 
dominant  negative  protein  corresponding  to  amino  acids  1738  to  2000  of  Mi-2a  (Figure  5A). 
Expression  of  these  amino  acids  dominantly  inhibited  KRAB-PAX3  mediated  repression  of  a 
DNA  template  in  a  dose  dependent  manner  (Figure  5A).  The  same  effect  was  observed  for 
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PAX3-KAP-1  directed  repression  (data  not  shown).  Expression  of  the  dominant  negative  protein 
had  little  effect  on  the  basal  activity  of  the  minimal  TK  promoter,  the  activity  obtained  in 
combination  with  other  non-KRAB  PAX3  repression  domain  fusions,  or  a  PAX3  fusion 
comprised  of  the  FKHR  activation  domain  (Figure  5A,  data  not  shown).  Consistent  with  the 
yeast  two-hybrid  data,  truncation  of  the  COOH-terminal  30  amino  acids  of  the  Mi-2a  KID 
impaired  this  inhibitory  activity  (data  not  shown).  Since  Mi-2a  has  been  described  as  integral 
component  of  a  high  molecular  weight  multiprotein  complex  containing  histone  deacetylase 
activity,  we  tested  whether  the  addition  of  trichostatin  A  (TSA)  could  reverse  the  repression 
activity  of  the  KAP-1  PHD  finger  and  bromodomain.  Figure  5B  illustrates  that  addition  of  300 
nM  TSA  partially  reversed  KAP-1  PHD/bromodomain-mediated  repression  of  three 
independent  promoters.  These  data  provide  evidence  that  Mi-2a  and  histone  deacetylases  may 
be  one  potential  downstream  effector  of  the  KRAB-ZFP:  KAP-1  repressoncorepressor  complex 
in  vivo. 

Discussion 

The  recurrent  theme  that  has  emerged  from  studies  aimed  at  understanding  the  functional 
relationships  of  protein-protein  interactions  is  that  highly  conserved  amino  acid  signature  motifs 
often  function  as  independent  globular  domains  involved  in  macromolecular  recognition.  The 
identification  of  theses  signature  amino  acid  sequence  motifs  and  their  spatial  organization 
within  a  novel  protein  is  often  the  first  clue  to  its  biochemical  function.  We  and  others  have 
previously  identified  a  universal  corepressor,  KAP-1,  which  binds  to  the  KRAB  repression 
module  (Friedman  et  al.,  1996;  Kim  et  al.,  1996;  Moosmann  et  al.,  1996).  Structurally,  KAP-1 
possesses  consensus  amino  acid  sequences  for  a  RING  finger,  B-boxes,  leucine  zipper  coiled- 
coil  region,  PHD  finger,  and  bromodomain  (Friedman  et  al.,  1996;  Kim  et  al.,  1996;  Le  Douarin 
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et  al.,  1996;  Moosmann  et  al.,  1996).  Biochemical  studies  of  KAP-1  indicate  that  the  tripartite 
RBCC  region  functions  as  an  integrated  structural  unit  that  is  necessary  and  sufficient  for  KRAB 
recognition  and  KAP-1  oligomerization  (Peng  et  al.,  2000).  Mechanistically,  a  direct  interaction 
with  members  of  the  HP1  protein  family  suggests  that  KAP-1  may  function  to  repress 
transcription  via  the  organization  of  higher  order  chromatin  structure  (Le  Douarin  et  al.,  1996; 
Lechner  et  al.,  2000;  Nielsen  et  al.,  1999;  Ryan  et  al.,  1999).  In  this  report,  we  present  data  that 
builds  upon  our  understanding  of  KAP-l’s  function  in  repression  of  transcription.  We  describe  a 
novel  transcriptional  repression  activity  defined  by  the  PHD  finger  and  bromodomain  of  KAP- 1 . 
These  domains  are  collectively  required  for  optimal  repression  of  DNA  templates  by  KAP-1,  and 
our  studies  suggest  that  these  two  motifs  form  a  functionally  integrated  unit,  as  mutations  or 
heterologous  substitutions  in  either  domain  disrupt  transcriptional  repression.  These  data  are 
consistent  with  this  bipartite  domain  providing  a  specific  interface  for  protein-protein 
interactions.  In  this  regard,  we  show  evidence  that  the  PHD  finger  and  bromodomain  of  KAP- 1 
may  function  to  target  the  histone  deacetylase  complex,  NuRD,  to  promoter  sequences  regulated 
by  KRAB-zinc  finger  proteins. 

The  PHD  finger  and  bromodomain  are  two  well  conserved  signature  motifs  widely 
distributed  among  nuclear  proteins  with  established  roles  in  the  formation  and  maintenance  of 
chromatin  structure  (Aasland  et  al.,  1995;  Jeanmougin  et  al.,  1997;  Winston  and  Allis,  1999). 
Thus,  the  presence  of  these  two  motifs  suggested  that  KAP-1  represses  transcription  via  some 
chromatin  mediated  mechanism.  Furthermore,  the  conservation  of  the  PHD  finger  and 
bromodomain  architecturally  in  at  least  three  independent  families  of  proteins,  implies  that  the 
two  domains  biochemically  function  together  (Bochar  et  al.,  2000;  Friedman  et  al.,  1996;  Jones 
et  al.,  2000;  Jones  et  al.,  2000;  Le  Douarin  et  al.,  1996;  Le  Douarin  et  al.,  1995;  Venturini  et  al., 
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1999).  In  support  of  this  hypothesis,  optimal  transcriptional  repression  by  KAP-1  is  dependent 
upon  the  integrity  of  this  bipartite  domain.  Comparative  analysis  of  the  PHD  finger  and 
bromodomain  of  TIFla,  TIFly  and  WCRF180  suggests  that  the  function  of  this  bipartite  domain 
is  not  equivalent  among  all  proteins  that  share  this  particular  spatial  arrangement  of  motifs. 
Heterologous  substitution  of  either  of  these  domains  in  KAP-i  with  related  sequences  essentially 
failed  to  complement  the  intrinsic  transcriptional  repression  activity  of  the  natural  KAP- 1 
sequence.  Thus,  it  appears  from  these  results  that  the  amino  acid  sequence  of  the  KAP-1  PHD 
finger  and  bromodomain  possess  an  inherent  capacity  to  fold  into  an  integrated  functional  unit, 
which  may  provide  a  specific  interface  for  protein-protein  interactions  with  downstream 
effectors  of  transcriptional  repression. 

The  PHD  finger  is  a  small  domain  characteristically  defined  by  seven  cysteines  and  a 
histidine  that  are  spatially  arranged  in  a  consensus  of  C4HC3  of  varying  lengths  and  composition 
(Aasland  et  al.,  1995).  This  particular  arrangement  of  amino  acids  is  highly  homologous  to  the 
RING  finger  motif,  an  interface  for  protein-protein  interactions  which  is  created  by  chelating  two 
zinc  ions  in  a  unique  cross-braced  fashion  (Borden,  2000;  Borden,  1998;  Jensen  et  al.,  1998;  Wu 
et  al.,  1996).  The  biological  importance  of  the  PHD  finger  to  protein  function  is  further 
underscored  by  the  identification  of  naturally  occurring  germline  mutations  which  encode  for 
amino  acid  substitutions  in  the  PHD  finger  of  hATRX  that  predispose  individuals  to 
developmental  defects  (Gibbons  et  al.,  1997).  Our  studies  represent  the  first  detailed  site- 
directed  mutagenesis  approach  to  correlate  the  molecular  anatomy  of  this  domain  with  a 
biochemical  function.  In  general,  mutation  of  any  highly  conserved  cysteine  residues  of  the  core 
signature  sequence  was  observed  to  dramatically  effect  the  repression  activity  of  KAP-1.  This 
observation  was  not  unexpected,  as  mutations  of  any  of  the  analogous  amino  acids  in  the  RING 
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finger  disrupt  zinc  chelation  and  protein  structure.  Mutation  of  the  histidine  at  the  putative  fifth 
metal  coordination  position  to  cysteine  had  marginal  effects  on  the  repression  activity,  but  it  is 
possible  that  the  introduced  cysteine  residue  is  capable  of  substituting  in  the  proper  coordination 
of  zinc.  In  addition  to  these  core  amino  acids  which  comprise  the  signature  sequence,  mutation 
of  W664  significantly  impaired  the  repression  activity,  while  the  M639C  and  P654C  mutations 
demonstrated  a  moderate  effect  on  repression.  The  introduction  of  an  extra  cysteine  residue  by 
the  M639C  and  P654C  mutations  may  have  altered  proper  metal  chelation  properties  of  this 
domain.  The  W664  amino  acid  is  highly  conserved  spatially  in  the  related  RING  finger  motif, 
and  mutation  of  the  analogous  amino  acid  in  the  c-Cbl  proto-oncogene  yielded  a  loss  of  function 
(Joazeiro  et  al.,  1999).  Preliminary  structural  studies  of  the  PHD  finger  indicate  that  this  residue 
is  likely  to  be  involved  in  maintenance  of  the  hydrophobic  core  of  this  domain's  fold 
(unpublished  data).  Overall,  the  functional  significance  of  any  of  these  mutations  will  have  to 
wait  until  a  structure  for  the  PHD  finger  is  solved,  and/or  new  biochemical  functions  for  this 
domain  are  described.  Nonetheless,  this  panel  of  mutations  will  serve  as  a  useful  tool  in  future 
studies  in  defining  new  macromolecular  interactions  and  functions  for  the  PHD  finger. 

Similar  to  the  PHD  finger,  the  bromodomain  is  a  highly  conserved  structural  motif  with  a 
putative  role  in  modulating  the  activities  of  multiprotein  transcriptional  regulatory  complexes 
(Bariev  et  al.,  1998;  Cairns  et  al.,  1999;  Dhalluin  et  al.,  1999;  Ornaghi  et  al.,  1999;  Sterner  et  al., 
1999;  Syntichaki  et  al.,  2000).  Furthermore,  the  function  of  the  bromodomain  in  these  proteins 
may  require  or  be  influenced  by  adjacent  signature  motifs  (Cairns  et  al.,  1999;  Winston  and 
Allis,  1999).  Our  data  from  the  analysis  of  the  KAP-1  bromodomain  are  consistent  with  these 
hypotheses.  First,  it  appears  that  the  bromodomain  of  KAP-1  has  intrinsic  ability  to  potentiate 
repression  of  transcription.  Second,  optimal  repression  of  transcription  by  KAP-1  is  dependent 
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upon  the  integrity  of  the  bipartite  PHD  finger  and  bromodomain,  as  mutations  in  either  domain 
are  sufficient  to  significantly  impair  KAP-1  directed  repression  of  reporter  templates.  Third,  the 
mutations  in  the  bromodomain  that  disrupt  the  repression  activity  are  consistent  with  this 
domain’s  involvement  in  macromolecular  recognition.  Finally,  significant  evidence  suggests 
that  some  feature  of  histones  or  nucleosomal  structure  serve  as  a  ligand  for  the  bromodomain. 
The  specific  interaction  between  the  PHD  finger  and  bromodomain  of  KAP-1  with  the  Mi-2a 
subunit  of  the  NuRD  complex  may  require  some  interface  provided  by  the  potential  interaction 
between  the  bromodomain  of  KAP-1  and  nucleosomes.  This  hypothesis  would  be  consistent  with 
recent  genetic  evidence  that  indicates  the  bromodomain  of  GCN5  is  required  for  the  chromatin 
remodeling  activities  of  the  SWI/SNF  complex  at  a  genomic  locus  in  yeast  (Syntichaki  et  al., 
2000).  In  this  regard,  both  the  His3  and  the  LacZ  reporters  used  in  the  selection  process  for 
positive  interacting  clones  in  the  two-hybrid  assay  were  chromosomally  integrated. 

Furthermore,  this  potential  ternary  complex  may  explain  why  no  polypeptides  with  an  obvious 
targeting  mechanism  were  identified  in  the  original  biochemical  purifications  of  NuRD  (Tong  et 
al.,  1998;  Xue  et  al.,  1998;  Zhang  et  al.,  1998).  Both  Mi- 2a  and  Mi-2p  have  been  describe  as 
integral  components  of  a  multi-protein  complex  containing  both  histone  deacetylase  and 
chromatin  remodeling  activities  (Tong  et  al.,  1998;  Wade  et  al.,  1998;  Xue  et  al.,  1998;  Zhang  et 
al.,  1998).  The  association  of  KAP-1  with  Mi-2a/CHD3  builds  upon  our  understanding  of  how 
the  KRAB-ZFP: KAP-1  repressor:corepressor  system  mechanistically  regulates  gene  expression. 
Moreover,  the  interaction  between  KAP-1  and  Mi-2a/CHD3  may  be  one  alternative  mechanism 
by  which  the  biochemical  activities  of  the  NuRD  complexes  are  targeted  to  specific  genes  in 
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Several  pieces  of  evidence  have  implied  a  role  for  the  acetylation  status  of  histone  tails 
with  mechanisms  of  long  term  silencing  of  gene  expression.  Genetic  analyses  in  D. 
melanogaster  have  demonstrated  a  potential  role  for  dMi-2  in  polycomb  silencing  of  homeotic 
genes  (Kehle  et  al.,  1998).  Consistent  with  this  genetic  observation,  the  sequence  specific 
transcription  factor  Ikaros  was  observed  to  colocalize  with  components  of  the  NuRD  complex 
and  the  heterochomatin  protein  1(M31),  during  lymphoid  cell  development  (Kim  et  al.,  1999). 

In  this  regard,  we  and  others  have  reported  a  direct  biochemical  interaction  between  KAP-1  and 
mammalian  homologues  of  the  HP1  protein  family,  and  have  shown  that  KAP-1  and  mammalian 
HP1  proteins  can  physically  occupy  the  same  spatial  domains  in  interphase  nuclei  of  NIH/3T3 
cells  (Ryan  et  al.,  1999).  Extensive  genetic  analyses  of  both  the  polycomb  and  HP1  protein 
families  have  firmly  established  a  role  for  these  proteins  in  epigenetic  gene  silencing  in  D. 
melanogaster.  Furthermore,  genetic  screens  for  dominant  suppressors  of  position-effect- 
variegation  in  D.  melanogaster  have  identified  mutant  alleles  of  HDAC1  (Mottus  et  al.,  2000). 
The  combination  of  these  data  implies  a  potential  link  between  multi-protein  complexes  that 
promote  transcriptional  repression  by  histone  deacetylation  and  repression  mediated  by 
organization  of  high-order  chromatin  structure.  In  support  of  this  hypothesis,  SAP30,  a  novel 
member  of  the  SIN3  histone  deacetylase  complex,  has  been  observed  to  effect  telomere  silencing 
in  yeast  (Zhang  et  al.,  1998).  This  apparent  coupling  of  two  independent  repression  mechanisms 
may  lead  to  synergistic  repression  of  their  apparent  target  genes. 

The  relative  abundance  of  the  KRAB  domain  may  potentially  make  it  one  of  the  single 
most  common  mechanisms  of  transcriptional  repression  directed  by  sequence-specific  factors  in 
vertebrates.  Transcriptional  repression  by  a  KRAB  domain  requires  interaction  with  the 
corepressor,  KAP-1.  Thus,  it  appears  that  the  effector  molecules  of  KRAB-mediated 
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transcriptional  repression  are  likely  due  to  a  network  of  protein  interactions  with  KAP-1.  We 
suggest  that  the  KAP- 1  corepressor  may  function  to  repress  transcription  through  several  distinct 
mechanisms,  including  histone  deacetylation  and  heterochromatinization.  Unlike  prokaryotic 
genes,  eukaryotic  genes  exist  in  a  “restrictive”  state  by  virtue  of  nucleosome  based  chromatin 
structure  (Struhl,  1999).  In  this  model,  the  role  of  sequence  specific  repressors  like  the 
superfamily  of  KRAB-ZFP’s  may  be  to  create  a  silent  gene  environment  completely  resistant  to 
activator  and/or  core  transcription  machinery  binding.  It  may  be  that  corepressor  molecules,  like 
KAP-1,  exist  to  assist  in  recruiting  independent  biochemical  activities  that  function 
combinatorially  to  optimally  repress  a  target  locus  in  vivo.  The  abundance  of  KRAB  domain 
proteins  and  the  capacity  of  the  KAP-1  corepressor  to  repress  transcription  through  multiple 
independent  mechanisms  increase  the  potential  of  the  KRAB: KAP-1  repressorxorepressor 
system  to  serve  as  a  universal  regulator  of  transcription. 

Materials  and  Methods 

The  pM2-KAP-l  (293-835)  mammalian  expression  vector  has  been  previously  described 
(Friedman  et  al.,  1996).  The  GAL4-KAP-1  fusion  proteins  including  408-835,  408-618,  478-835, 
478-680,  478-618  were  created  by  PCR  using  the  primers  408for  (BamHl)  5’- 
CCGGGATCCAGATTGTGGCAGAGCGTCCTG-3’,  478for(fiamHI)  5’- 
CCGGGATCCAGGTGAGCGGCCTTATGCGC-3’,  618rev(tfmr/III)  5’- 
G  AT  A  AGCTTTC  ACGGGCC  ACC  ACCTGGGGC-3  ’ ,  835rev(XbaI)  5’- 

GCTATCT  AG  ACTA  A  ATGGTGGC  ACTGTC  ATCC  AGG-3  ’ ,  respectively.  The  resulting  PCR 
products  were  digested  with  the  restriction  endonucleases  indicated  in  brackets  preceding  the 
primer  sequence  and  cloned  into  the  corresponding  restriction  sites  of  the  vector  pMl.  The 
GAL4-KAP- 1  (619-835)  expression  plasmid  was  constructed  by  subcloning  a  Xmal  fragment 


Schultz  et  al.-  20 


from  pBL-KAP-1  into  the  Xmal  site  of  pMl.  The  GAL4-KAP-1  (619-679)  expression  plasmid 
was  created  by  PCR  using  the  primers,  619for(£,coRI)  5’- 
TAGCGAATTCGGA ACCCTGGATGACAGTGC-3 ' 

and  679rev(Sfl/I)  5  ’  -  AT CGGTGG AC ATCCTCCTCCTTC AGGTC AG-3  ’ .  The  resulting  PCR 
product  was  digested  with  the  restriction  endonucleases  indicated  in  brackets  and  cloned  into  the 
corresponding  sites  of  the  vector  pMl.  Carboxy-terminal  to  codon  679  of  KAP-1  the  protein 
reads  VDASAEASR*.  The  GAL4-KAP-1  (674-835)  expression  plasmid  was  created  by  PCR, 
using  the  primers,  674for(toHI)  5’-ATCGGATCCGACCTGAAGGAGGAGGATGGC-3’  and 
835rev(//iWIII)  5 ’  -G ATCCCGGG A AGCTTTC AGGGGCC ATC ACCTGG-3 ’ .  The  resulting 
PCR  product  was  digested  with  the  restriction  endonucleases  indicated  in  brackets  and  cloned 
into  the  corresponding  sites  of  the  vector  pM2. 

Site-directed  point  mutations  in  the  KAP-1  PHD  finger  and  bromodomain  were 
engineered  by  standard  overlap  extension  PCR-mediated  mutagenesis  procedures.  The 
mutagenic  primers  for  the  described  mutations  contained  the  following  codons:  CC628,631AA, 
TGC  to  GCC;  C628R,  TGC  to  CGC;  V630S,  GTC  to  AGC;  G635A,  GGC  to  GCC;  D636A, 

GAT  to  GCT;  M639C,  ATG  to  TGC;  C643A,  TGT  to  GCT;  C646V,  TGT  to  GTT;  F647Y,  TTC 
to  TAC;  H648C,  CAC  to  TGC;  C651F,  TGT  to  TTT;  P654C,  CCG  to  TGC;  Q657E,  CAG  to 
GAG;  P660A,  CCA  to  GCA;  W664A,  TGG  to  GCG;  C666A,  TGC  to  GCC;  SL667,668TF, 

TCA  and  CTC  to  ACA  and  TTC;  E715K,  GAA  to  AAA;  S730I,  TCC  to  ATC;  L742T,  CTG  to 
ACG;  F761A,  TTT  to  GCT;  D778,781AA,  GAC  to  GCT;  F791E,  TTC  to  GAG.  The  pMl-KAP- 
1  (619-835)  trunc781  resulted  from  spontaneous  deletion  of  G2444  from  the  nucleotide  sequence 
during  PCR  of  the  D778,781AA  mutation  and  a  reading  frame  shift  that  results  in  a  protein  with 
15  novel  amino  acids,  AKAACSPSSGCSASSRRA*  starting  at  codon  778  of  KAP-1. 
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The  GAL4  expression  plasmids  for  the  PHD  finger  and  bromodomain  of  TIFla  (amino 
acids  785-1016),  TIFly  (amino  acids  881-1 120),  and  WCRF180  (amino  acids  1 141-1556)  were 
created  by  PCR  using  the  primers  TIFlafor(5/;;aI)  5’- 
GATCCCGGGAGGAAGGAGGATGACCCC-3’,  TIFlarev(5/;/«I)  5’- 
GT ACCCGGGTT ACTT AAGC AGCTGGCG-3 ’ ,  TIFlyfor(5moI)  5’- 
GATCCCGGGAATAAAGATGATGACCC-3’,  TIFlyrev(Swal)  5’- 
CTGCCCGGGTC ATCTG ACTTT AGGCGT -3 ’ ,  WCRF180for(SmaI)  5’- 
G  ATCCCGGG  AT  ATGGTCT  A  A  ATCT  AT  A-3  ’ ,  and  WCRF180rev(5maI)  5’- 
CTGCCCGGGTCAGATTCGTGACTTTTTTGC-3’,  respectively.  The  resulting  PCR  products 
were  digested  with  the  restriction  endonucleases  indicated  in  brackets  and  cloned  into  the 
corresponding  sites  of  either  pMl  or  pM3.  The  PHD  finger  and  bromodomain  chimeras  were 
engineered  by  standard  overlap  extension  PCR-mediated  mutagenesis  procedures.  Forward 
primers  were:  ATRX  5’(SmaI)  5’ -GACCCGGGT AGCCGTGACTCAGATGG-3 ’ ,  Mi-2a 
5’(Smal)  5 ’ -G ATCCCGGGG ATGGCT ACG AGACGG ATC-3 ’ ,  and  as  indicated  above  for 
WCRF180  and  KAP-1.  Fusion  primers  were  as  follows:  TIFlcc/KAP-lfor  5’- 
AAGCCAGAGGTTGACTATGGCAGCCTCAGCCTGGAT-3’,  TIFly/KAP-lfor  5’- 
AAGCCAGAAGTTGAATATGGCAGCCTCAGCCTGGAT-3’,  ATRX/KAP-lfor  5’- 
ACTGCATGTAACAGCGTAGGCAGCCTCAGCCTCAGC-3’,  Mi-2ot/KAP-lfor  5’- 
GTCCAGTGGGAGGCC AAGGGCAGCCTCAGCCTCAGC-3 ’ ,  WCRF180/KAP-lfor  5’- 
CGTTCTAGAAGACTCTCCGGCAGCCTCAGCCTCAGC-3’,  KAP-1 /TIFlafor  5’- 
GACCTGAAGGAGGAGGATGATTGTGATGTTCCCAGT-3’,  KAP-1 /TIFlyfor  5’- 
GACCTGAAGGAGGAGGATGATTGTGATAATTTGCAA-3’,  KAP-1  AVCRF180for  5’- 
G  ACCTGA  AGGAGG  AGG  AT  AG  ATCTGT  A  AAT  ATTGCT-3  ’ ,  KAP-l/GCN5for  5’- 
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GACCTGAAGGAGGAGGATACAGGCTGGAAGCCATTG-3’.  Reverse  primers  were  as 
follows:  GCN5rev(S/nflI)  5’-GATCCCGGGCTACTTGTCAATGAGGCC-3\  and  as  indicated 
above  for  KAP-1,  TIFlot,  TIFly,  and  YVCRF180.  Second-round  PCR  products  were  digested 
with  restriction  endonucleases  as  defined  in  the  parenthesis  preceding  the  primer  sequence  and 
cloned  into  the  corresponding  sites  of  either  pMl  or  pM3. 

The  LEXA-KAP-1  (amino  acids  619-835)  yeast  expression  plasmid  was  generated  by  the 
ligation  of  a  Xmal  fragment  from  pBL- KAP-1  into  pBTMl  16.  PHD  finger  and  bromodomain 
mutants  were  subcloned  from  pMl  to  pBTMl  16  as  a  Smal  fragment.  GAL4  activation  domain 
fusions  with  the  C-terminus  of  Mi-2adel  and  Mi-2(3  were  created  by  PCR  using  the  following 
primers,  respectively:  Mi-2afor(fia/nHI)  5’- 

GTAACGGATCCAGGGCCTCGAGATGAGCCAC-3’,  Mi-2(X  delta  CTrev(fimnHI)  5’- 
G  ATCGG  ATCCT  C  ACGTTGGT  GGCGGCT  GT  G  ATG  A  AG-3  ’ ,  Mi-2pfor(BamHI)  5’- 
GTAACGGATCCAAATGGAGAGACCCCCAAGGACCTG-3’,  Mi-2(3  rev(BamUl)  5’- 
G ATCGG ATCCTCACTGCTGCTGGGCT ACCTGCTG-3 ’ .  Resulting  PCR  products  were 
digested  with  BamUl  and  cloned  into  the  corresponding  restriction  site  of  pACTII  (Clontech). 

The  CMV-Mi-2a  ct  mammalian  expression  plasmid  was  created  by  subcloning  a 
BgllUBamUl  fragment  from  the  rescued  cDNA  in-frame  with  an  N-terminal  6His-tag  possessing 
a  consensus  Kozak  sequence  and  initiator  methionine.  The  KRAB-PAX3  and  the  PAX3-KAP-1 
expression  constructs  have  been  described  previously  (Fredericks  et  al.,  2000).  Appropriate 
reading  frame  fusions  and  integrity  of  flanking  sequences  for  all  constructs  created  by  PCR  was 
confirmed  by  DNA  sequence  analysis  of  both  strands. 


Schultz  et  al.-  23 


Yeast  two-hybrid :  The  yeast  two-hybrid  system  as  modified  by  Stan  Hollenberg  was  used  for  all 
yeast  experiments.  A  human  oligo-dT-primed  B-cell  cDNA  library  was  screened  as  previously 
described  (Jensen  et  al.,  1998). 

Transient  transfection:  Protein  expression  from  all  plasmids  was  confirmed  by  transient  transfection 
of  COS-1  cells  followed  by  immunoprecipitation  of  ,5S-methionine  labeled  cell  extracts,  as 
described  previously.  All  transcription  assay  transfections  were  done  as  previously  described  (Ryan 
et  al.,  1999). 
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Figure  Legends 

Figure  1.  Analysis  of  the  intrinsic  repression  activity  of  KAP-1  indicates  that  the  PHD  finger 
and  bromodomain  are  required  and  sufficient  to  repress  transcription.  (A)  Schematic  illustration 
of  the  heterologous  GAL4-KAP-1  constructs,  as  defined  by  the  amino  acid  numbers  at  the  left. 
The  intrinsic  repressor  activity  of  KAP- 1  was  measured  in  a  transient  assay,  using  a  minimal  TK- 
luciferase  reporter  template  regulated  by  5  consensus  GAL4  UAS.  All  experiments  were  done  in 
NIH/3T3  cells  with  0.5  pg  of  reporter  plasmid  and  5  pg  of  the  indicated  heterologous  GAL4- 
KAP-1  construct.  Fold  repression  represents  the  ratio  of  luciferase  activity  measured  for  the 
reporter  alone  to  the  activity  measured  in  the  presence  of  the  indicated  effector  proteins  after 
normalization  for  transfection  efficiency.  Error  bars  represent  the  standard  deviation  for  three 
independent  transfections.  Absence  of  error  bars  indicates  a  standard  deviation  to  little  to 
physically  illustrate. 

Figure  2.  Mutations  in  the  PHD  finger  and  bromodomain  significantly  impair  the  intrinsic 
repression  activity  of  this  integrated  transcriptional  repression  domain.  (A)  Amino  acid  sequence 
alignment  of  the  KAP-1  PHD  finger  with  related  sequences  from  13  independent  proteins.  The 
numbers  in  brackets  indicate  the  corresponding  amino  acids  in  each  protein.  Strictly  conserved 
amino  acids  are  shaded  in  black.  Boxed  amino  acids  in  which  the  chemical  nature  of  the  side 
chain  has  been  conserved.  Asterisks  indicate  amino  acids  that  were  mutated  to  match  naturally 
occurring  mutations  in  the  hATRX  protein.  The  pound  (#)  symbol  represents  conserved  amino 
acids  that  were  mutated  .  Dollar  signs  ($)  represent  amino  acids  in  KAP-1  which  were  mutated 
to  match  the  corresponding  amino  acid  in  TIFla/TIFly.  Each  mutation  was  made  in  the  context 
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of  the  GAL4-KAP-1  (619-835)  expression  construct.  (B)  Amino  acid  sequence  alignment  of  the 
KAP-1  bromodomain  with  related  sequences  from  TIFla,  TIFly,  WCRF180.  hGCN5,  and 
pCAF.  Schematic  illustration  of  the  secondary  structure  and  relevant  position  of  each  structural 
element  in  the  bromodomain  sequence  is  indicated  above.  Amino  acid  residues  that  have  been 
conserved  or  where  the  chemical  nature  of  the  side  chain  has  been  maintained  are  shaded  in 
black.  Bolded  amino  acids  in  the  pCAF  sequence  represent  amino  acids  mutated  in  the  study  by 
Dhalliun  et  al.  Asterisks  mark  the  amino  acids  that  were  mutated  in  this  study.  The  vertical  line 
identifies  the  position  of  the  781trunc  mutation.  Each  mutation  was  made  in  the  context  of  the 
GAL4-KAP-1  (619-835)  expression  construct.  (C)  All  experiments  were  done  in  NIH/3T3  cells, 
as  described  in  Figure  1.  Black  bars  represent  PHD  finger  mutations;  Grey  bars  represent 
bromodomain  mutations.  Stable  expression  of  each  protein  was  determined  via  transfection  into 
COS1  cells  followed  by  immunoprecipitation  of  35S-met  labeled  whole  cell  extracts  with  anti- 
GAL4  (DBD)  antiserum  (1  (Xg). 

Figure  3.  Heterologous  chimeric  fusions  of  related  PHD  fingers  or  bromodomains  to  KAP- 1 . 

(A)  Schematic  illustration  of  chimeric  fusion  proteins  engineered.  The  chimeric  fusion  proteins 
possess  the  following  amino  acid  sequences  of  the  individual  proteins  respectively:  TIFla/KAP- 
1(815-876/680-835),  TIFly/KAP-l(880-941/680-835),  ATRX/KAP-l(210-283/680-835),  Mi- 
2oc/KAP-l  (372-423/680-835),  WCRF180/KAP-1  (1 141-1205/680  835),  KAP-l/TIFla(619- 
679/877-1016),  KAP-1 /TIFly  (619-679/942-1 120),  KAP-1/GCN5  (619-679/714-837),  and  KAP- 
1/WCRF180  (619-679/1389-1556).  All  experiments  were  done  in  NIH/3T3  cells,  as  described  in 
Figure  1.  White  bars  represent  natural  PHD  finger/bromodomain  fusions;  Black  bars  represent 
chimeras  with  the  KAP-1  bromodomain;  Grey  bars  represent  chimeras  with  the  KAP-1  PHD 
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finger.  (B)  Each  plasmid  was  tested  for  stable  protein  expression  via  transfection  into  COS  1 
cells  followed  by  immunoprecipitation  of  :'5S-met  labeled  whole  cell  extracts  with  anti-GAL4 
(DBD)  antiserum  (1  jig).  Arrow  represents  the  migration  of  the  heterologous  GAL4-WCRF180 
PHD  finger/bromodomain.  The  asterisk  represents  a  non-specific  band  retained  during  the 
immunoprecipitation. 

Figure  4.  The  PHD  finger  and  bromodomain  of  KAP-1  specifically  interact  with  the  COOH- 
terminal  amino  acids  of  Mi-2a/CHD3.  (A)  The  yeast  strain  L40  cotransformed  with  the 
indicated  KAP-1  mutants  as  LEXA  DNA  binding  domain  fusions  and  the  rescued  Mi- 
2a-ct:GAL4  activation  domain  clone.  Positive  interaction  is  inferred  by  blue  yeast  due  to  the 
activation  of  the  integrated  LEXA  responsive  LacZ  reporter  gene  and  subsequent  hydrolysis  of 
the  synthetic  substrate  X-gal.  (B)  The  interaction  between  KAP-1  and  Mi-2  is  specific  for  the  C- 
terminus  of  the  alpha  isoform.  Mi-2a/CHD3  is  a  member  of  the  CHD  family  of  proteins  which 
are  characterized  by  the  seven  signature  motifs  of  a  "DEAH"  box  ATPase/Helicase.  Other 
signature  motifs  include  two  PHD  fingers  (light  grey  hexagons),  two  chromodomains  (hatched 
circles),  and  a  putative  DNA  binding  domain  (stripped  bar).  The  putative  KAP-1  interaction 
domain  (KID)  is  indicated.  In  frame  fusion  between  the  COOH-terminal  sequences  of  hMi- 
2a  (accesion  number  3298562),  hCHD3  (accession  number  2645433),  hMi-2(3/CHD4 
(accession  number  1 107696)  were  designed  with  the  GAL4  activation  domain.  Numbers 
represent  the  corresponding  amino  acid  in  the  respective  sequences.  L40  yeast  cotransformed 
with  the  indicated  GAL4  activation  domain  fusion  proteins  and  the  wild  type  LEXA  KAP-1 
PHD  finger  and  bromodomain  illustrated  the  specificity  of  this  interaction  for  Mi-2a. 
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Figure  5.  Role  of  Mi-2a  and  histone  deacetylase  in  KAP- 1  repression.  (A)  Expression  of  the 
KAP-1  interaction  domain  of  Mi-2a  (amino  acids  1738  to  2000)  dominantly  inhibits  KRAB- 
mediated  repression.  Schematic  illustration  of  the  expression  plasmids  and  luciferase  reporter 
used  in  transient  transcription  assays  are  displayed  .  Black  triangles  represent  titrating  amounts 
(1  pg,  2  pg,  and  4  pg)  of  plasmid  DNA.  Expression  of  protein's  was  confirmed  in  COS1  cells 
(data  not  shown).  Fold  repression  was  calculated  as  described  in  Figure  1.  (B)  Addition  of  the 
histone  deacetylase  inhibitor,  TSA,  partially  reverses  the  repression  activity  of  the  KAP-1  PHD 
finger/bromodomain.  NIH/3T3  cells  were  transiently  co-transfected  with  0.5  pg  of  the  indicated 
reporter  plasmid  and  5  pg  of  the  heterologous  GAL4  expression  plasmids:  KAP-1  (619  to  835), 
SAP30,  BCL6-POZ.  Twenty-four  hours  post-transfection,  the  cells  were  treated  with  300  nM 
TSA  (Wako)  for  an  additional  24  hours  prior  to  harvesting.  Fold  repression  was  calculated  as 
described  in  Figure  1.  Fold  repression  in  the  absence  of  TSA  (-TSA,  stippled  bars);  fold 
repression  in  the  presence  of  TSA  (+TSA,  Black  bars). 
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Abstract 


PHD  domains  are  found  in  over  400  eukaryotic  proteins,  many  of  which  are 
transcriptional  regulators.  Naturally  occurring  point  mutations  or  deletion  of  this  domain 
contribute  to  a  variety  of  human  diseases  including  ATRX  syndrome,  myeloid  leukemias  and 
autoimmune  dysfunction.  Here  we  report  the  first  structural  characterization  of  a  PHD  domain. 
Our  studies  reveal  that  the  PHD  domain  from  KAP-1  corepressor  binds  zinc  in  a  cross-brace 
topology  between  anti-parallel  P-strands  reminiscent  of  RING  domains.  By  a  mutational 
analysis,  we  define  the  structural  features  required  for  transcriptional  repression  by  KAP-1  and 
explain  naturally  occurring,  disease-causing  mutations  in  PHD  domains  of  other  proteins.  From 
a  comparison  of  the  PHD  structure  with  previously  reported  RING  and  LIM  structures,  we  infer 
sequence  determinants  which  allow  discrimination  among  PHD,  RING,  and  LIM  motifs. 

(We  are  in  the  process  of  submitting  coordinates  to  the  PDB) 


Keywords:  KAP-1 /LIM/PHD/RING/zinc-fmger 
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Introduction 


Biological  processes  depend  on  the  spatial  and  temporal  targeting  of  specific 
macromolecular  interactions.  The  presence  of  highly  conserved  amino  acid  sequence  motifs  and 
their  structural  arrangement  within  a  novel  protein  can  provide  the  first  clue  to  the  protein’s 
function.  Modular  motifs  within  multi-domain  proteins  often  specify  interacting  partners,  and  the 
identification  of  these  interactions  has  been  beneficial  in  defining  biochemical  functions  of 
particular  proteins  and  their  cellular  pathways.  Furthermore,  these  modular  domains  are  often 
targets  of  mutations  that  disrupt  the  cellular  pathways  in  which  they  function.  Identification  of 
modular  domain  type  through  sequence  homology  studies  alone  can  lead  to  ambiguous  domain 
type  assignment,  making  prediction  of  function  a  difficult  enterprise.  Thus,  biophysical 
characterization  and  determination  of  the  3D  structure  are  essential  for  understanding  cellular 
and  biological  function. 

The  plant  homeodomain  (PHD)  or  leukaemia  associated  protein  (LAP)  domain  is  a 
relatively  small  motif  of  approximately  60  amino  acids  that  is  found  in  over  400  eukaryotic 
proteins,  many  of  which  are  believed  to  be  involved  in  the  regulation  of  gene  expression, 
including  the  KAP-l/TIFip,  WCRF/WSTF,  Mi-2  and  CBP/p300  families  (Aasland  et  al,  1995). 
It  has  been  suggested  that  this  domain  is  involved  in  protein-protein  interactions  related  to  a 
possible  role  in  chromatin-mediated  regulation  of  gene  expression  (Jacobson  and  Pillus,  1999). 
Although  no  direct  role  in  transcriptional  processes  has  been  reported  for  this  domain,  recent 
reports  indicate  that  the  extended  PHD  domain  of  AF10  is  necessary  for  homo-oligomerization, 
an  event  required  for  the  ability  of  the  AT  hook  motif  in  AF10  to  bind  DNA  (Linder  et  al., 

2000). 
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The  biological  importance  of  the  PHD  domain  is  underscored  by  its  involvement  in  the 
pathogenesis  of  several  human  disorders.  Clinically  relevant  missense  mutations  in  the  PHD 
domain  of  the  ATRX  protein  result  in  a-thalasemia  and  mental  retardation  (Gibbons  et  al.,  1997; 
Rinderle  et  al.,  1999).  Germline  nonsense  mutations  in  the  AIRE  gene  result  in  truncated  proteins 
where  one  or  both  of  the  PHD  domains  are  deleted  in  patients  with  autoimmune  polyglandular 
syndrome  type  1  (APECED)  (The  Finnish-German  APECED  Consortium,  1997).  Somatically 
acquired  point  mutations  in  the  PHD  domain  of  ING1  have  been  identified  in  head  and  neck 
squamous  cell  carcinomas  (Gunduz  et  al.,  2000).  Chromosomal  rearrangements  that  delete  this 
motif  in  proteins  such  as  MLL,  CBP,  MOZ  and  AF10  result  in  myeloid  leukemias  (Jacobson  and 
Pillus,  1999).  Further,  genes  encoding  PHD  domain  proteins  have  been  identified  in  the  critical 
deletion  regions  of  several  contiguous  gene  deletion  syndrome  such  as  Williams  syndrome 
(WSTF)  and  the  immunodeficiency  syndrome  ICF  (DMNT3B)  (Aapola  et  al.,  2000;  Lu  et  al., 
1998).  The  prevalence  of  disease-causing  mutations  in  PHD  domains  suggests  that  they  play  a 
basic  and  essential  role  in  the  normal  function  of  human  cells. 

The  PHD  domain  is  a  motif  characteristically  defined  by  seven  cysteines  and  a  histidine 
that  are  spatially  arranged  in  a  C4HC3  consensus  with  intervening  sequences  of  varying  length 
and  composition  (Aasland  et  al.,  1995).  This  particular  arrangement  of  amino  acids  is  highly 
homologous  to  the  RING  and  to  the  LIM  domains  (Figure  1  A)  (Borden  and  Freemont,  1996; 
Dawid  et  al.,  1994).  In  both  the  RING  and  the  LIM  domain,  these  conserved  cysteine  and 
histidine  residues  are  utilized  to  bind  two  zinc  atoms,  a  process  cooperatively  coupled  to  the 
folding  of  the  domains.  The  LIM  uses  a  sequential  zinc  ligation  scheme,  where  the  first  and 
second  pair  of  metal  ligands  creates  the  first  zinc-binding  site,  while  the  third  and  fourth  pair 
form  the  second  site  (Figure  IB).  In  contrast,  the  RING  domain  utilizes  a  unique  cross-brace 
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ligation  topology,  where  the  first  and  third  pair  of  metal  ligands  form  the  first  zinc-binding  site 
(site  I),  while  the  second  and  fourth  pair  form  the  second  site  (site  II)  (Figure  IB).  The  different 
arrangements  of  metal  ligands  result  in  dramatically  different  three-dimensional  structures 
(Figure  1C).  The  choice  of  ligation  topology  is  undoubtedly  critical  for  proper  folding  and 
domain  functionality.  Given  the  sequence  similarity  amongst  PHD,  RING  and  LIM  domains, 
one  is  unable  to  predict  the  ligation  scheme  and  fold  by  sequence  analysis  alone. 

Here,  we  report  the  first  solution  structure  of  the  PHD  domain  from  the  KAP-1 
corepressor  (also  known  as  TIFip  or  KRIP1),  a  universal  corepressor  for  the  KRAB-zinc  finger 
superfamily  of  transcriptional  repressors  (Ryan  et  al.,  1999).  We  demonstrate  that  this  minimal 
domain  binds  two  zinc  atoms  in  a  cross-brace  RING-like  arrangement.  Not  only  are  the  zinc 
ligation  schemes  identical,  but  the  tertiary  topologies  of  the  PHD  and  RING  domains  are 
remarkably  similar.  From  the  comparison  of  PHD,  RING  and  LIM  domains,  it  is  possible  to 
derive  unique  structural  determinants.  Structure  based  site-directed  mutational  analysis  reveals 
structural  features  of  the  PHD  domain  that  are  critical  for  the  ability  of  KAP-1  to  repress 
transcription.  Furthermore,  we  are  able  to  rationalize  the  structural  consequences  of  mutations  in 
the  PHD  domains  of  other  proteins  such  as  ATRX  and  ING1,  where  these  mutations  are 
correlated  with  human  disease. 
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Results  and  Discussion 


The  PHD  domain  requires  zinc  for  folding 

The  conservation  of  eight  potential  metal  ligands  in  the  PHD  family  suggested  that  it  is  a 
zinc-binding  protein.  In  order  to  analyze  the  chemical  and  physical  characteristics  of  the  PHD 
domain,  we  expressed  the  minimal  KAP-1  PHD  (amino  acids  618-679)  in  Escherichia  coli  and 
purified  the  protein  to  homogeneity  using  affinity  and  size  exclusion  chromatographies. 
Analytical  ultracentrifugation  experiments  suggest  that  greater  than  90%  of  the  PHD  protein  in 
solution  is  a  monomer  (data  not  shown).  To  determine  the  identity  and  stoichiometry  of  metal 
bound  by  the  PHD  domain,  we  used  inductively  coupled  plasma  (ICP)  spectrometry.  These 
measurements  indicate  that  a  3.0  mM  solution  of  KAP-1  PHD  protein  contained  5.2  mM 
(±3.3%)  zinc.  This  ratio  of  1  protein  molecule  to  1.7  zinc  atoms  indicates  that  the  PHD  binds  2 
zinc  atoms  per  protein  molecule,  potentially  utilizing  its  eight  conserved  cysteine  and  histidine 
residues  for  metal  ligation. 

Based  on  this  result,  we  investigated  the  role  of  zinc  in  structure  formation  of  the  PHD 
domain  using  circular  dichroism  (CD)  and  nuclear  magnetic  resonance  (NMR)  spectroscopies. 
The  CD  spectrum  of  KAP-1  PHD  denotes  a  folded  protein  with  substantial  |3-sheet  and  no 
significant  helical  content  (Figure  2A).  Introduction  of  the  metal  chelating  agent  EDTA  results 
in  complete  loss  of  secondary  structure.  Addition  of  ZnCh,  in  excess  of  EDTA,  results  in  re¬ 
folding  of  the  domain  (data  not  shown).  Similarly,  'H/^N-HSQC  NMR  experiments  indicate 
that  the  KAP-1  PHD  domain  requires  zinc  for  proper  folding  (Figures  2B  and  2C).  In  these 
experiments,  the  PHD  domain  was  expressed  in  minimal  media  in  the  presence  or  absence  of 
zinc  (see  Materials  and  methods).  The  spectrum  of  the  sample  expressed  in  the  absence  of  zinc 
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shows  poor  dispersion  of  amide  proton  chemical  shifts,  indicative  of  an  unfolded  protein  (Figure 
2B).  In  contrast,  the  sample  expressed  in  the  presence  of  zinc  produces  a  spectrum  revealing  a 
substantial  increase  in  chemical  shift  dispersion,  a  decrease  in  line  widths,  and  the  appearance  of 
virtually  all  expected  backbone  amide  resonances;  all  hallmarks  of  a  well-folded  protein  (Figure 
2C).  Thus,  like  RING  and  LIM  domains,  the  binding  of  two  zinc  atoms  is  necessary  for  folding 
of  the  PHD  domain. 

The  structure  ofKAP-1  PHD  domain  reveals  a  cross-brace  zinc  ligation  scheme 

We  determined  the  solution  structure  of  the  KAP-1  PHD  domain  using  standard 
homonuclear  and  heteronuclear  NMR  techniques.  Initial  NMR  studies  were  conducted  on  an  N- 
terminal  His-tagged  construct  ofKAP-1  (amino  acids  618-679)  containing  the  entire  PHD 
domain.  A  comparison  of  2D  ’H/15N  HSQC  and  NOESY  spectra  of  tagged  and  untagged 
proteins  revealed  that  the  two  spectra  were  identical  apart  from  resonances  corresponding  to  the 
tag  (data  not  shown).  Thus,  the  presence  of  this  N-terminal  His-tag  had  no  influence  on  the 
structure  of  the  PHD  domain. 

An  initial  set  of  structures  was  generated  without  reference  to  metal  ligation  in  order  to 
determine  the  residues  involved  in  zinc-binding.  These  structures  indicated  a  cross-brace 
ligation  scheme  with  residues  C628,  C631,  H648  and  C651  forming  one  zinc-binding  site  (site  I) 
while  C640,  C643,  C666  and  C669  forming  the  other  (site  II).  These  sites  were  partially  defined 
by  the  individual  NOEs  observed  between  residues  implicated  in  zinc  ligation.  For  instance, 
long-range  NOEs  were  observed  between  C666  and  C640,  C631  and  C651,  C640  and  C669,  in 
addition,  several  NOEs  between  residues  adjacent  to  the  zinc  ligands  were  observed.  Subsequent 
structure  calculations  included  Zn  atoms  and  additional  constraints  defining  tetrahedral 
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coordination.  To  ensure  that  a  cross-brace  was  the  correct  zinc  ligation  scheme,  alternative 
schemes  were  calculated  but  none  satisfied  the  experimental  constraints.  The  final  ensemble  is 
displayed  in  Figure  3  A  and  all  atoms  depiction  in  Figure  3B.  Statistics  of  the  ensemble  are  given 
in  Table  I. 

The  structure  of  KAP-1  PHD  domain  reveals  a  globular  domain  binding  two  Zn  atoms  in 
a  distinguishing  cross-brace  fashion  (Figure  3).  In  this  arrangement  residues  C628,  C631,  H648 
and  C651  form  a  single  zinc-binding  site  (site  I),  while  C640,  C643,  C666  and  C669  form  the 
second  site  (site  II)  (refer  to  Figure  3E).  The  formation  of  site  I  begins  with  an  eight  residue  loop 
(residues  627-634)  which  continues  into  an  extended  region  (residues  635-636)  followed  by  the 
first  (3-strand  (Pi;  residues  637-640).  Site  II  is  then  created  by  a  turn  (residues  641-644)  that 
ends  into  the  second  P-strand  (P2;  residues  645-647)  running  anti-parallel  to  Pi.  The  first  zinc¬ 
binding  site  is  completed  by  another  loop  (residues  648-652).  This  portion  is  followed  by  an 
extended  flexible  region,  (residues  653-663;  shown  in  blue,  Figure  3A)  demarcated  by  two 
proline  residues  (P654  and  P660)  which  appear  to  act  as  a  hinge  relative  to  the  rest  of  the  domain 
(Figure  3A).  This  flexible  region  is  followed  by  a  third  p-strand  (P3;  residues  664-666)  leading 
to  the  completion  of  the  second  zinc-binding  site  (residues  666-670).  Numerous  hydrophobic 
interactions  appear  to  stabilize  this  structure.  F647  is  at  the  centre  of  the  hydrophobic  core 
packing  against  W664,  V638,  L656,  and  H652.  Interestingly,  the  proton  resonances  for  the 
aromatic  ring  of  F647  are  not  degenerate  indicating  that  the  phenyl  group  is  not  free  to  rotate  in 
the  core  suggesting  that  the  core  is  tightly  packed.  Analysis  of  the  electrostatic  surface  potential 
reveals  two  areas  of  discrete  charge  (Nicholls  et  al.,  1991).  Most  notably,  the  flexible  hinge 
(residues  653-663)  is  lined  with  several  Glu  and  Asp  residues  contributing  to  a  negatively 
charged  surface  and  the  presence  of  Arg629  and  Lys633  results  in  a  positively  charged  cluster 
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around  site  I.  Ongoing  mutational  analyses  will  be  utilized  to  determine  the  importance  of  these 
features  to  KAP-1  PHD  function.  In  terms  of  charge  and  topology,  no  structural  features  typical 
of  nucleic  acid  binding  proteins  are  observed,  such  as  those  present  in  the  Kruppel-like  or 
GATA-like  zinc  fingers  (Klevit  et  al.,  1991;  Omichinski  et  al.,  1993).  This  suggests  that  the 
PHD  is  not  a  nucleic  acid  binding  domain. 

The  PHD  domain  structurally  resembles  a  RING 

Unlike  the  sequential  zinc  ligation  pattern  used  by  the  LIM  domains,  the  KAP-1  PHD 
domain  structure  shows  a  zinc  ligation  pattern  similar  to  RING  domains.  Therefore,  we  carried 
out  a  detailed  comparison  between  the  PHD  domain  and  two  previously  reported  RING  domain 
structures,  from  the  promyelocytic  leukemia  protein  PML  and  the  immediate  early  equine  herpes 
virus  protein  IEEHV  (Protein  Data  Bank  codes  lbor  and  lchc,  respectively;  (Barlow  et  al.,  1994; 
Borden  et  al.,  1995a)).  These  proteins  have  no  sequence  homology  outside  of  conserved  metal 
binding  residues.  Strikingly,  in  all  three  cases  the  inter-zinc  distance  is  ~14A,  presumably 
because  the  central  p-strand  of  these  molecules  is  the  same  length  (|32  in  KAP-1  PHD)  and  all  use 
the  cross-brace  ligation  scheme.  Because  PHD  and  RING  domains  use  different  permutations  of 
cysteines  and  histidines  for  metal  ligation,  for  clarity  we  denote  the  zth  conserved  metal-ligating 
residue  along  the  primary  sequence  by,  m/,-.  Notably,  a  conserved  hydrophobic  core  residue 
(F647)  is  located  in  (32,  just  N-terminal  to  ml5.  Inspection  of  these  structures  indicates  that  the 
first  zinc-binding  site  and  anti-parallel  (3-strands  ((3i|32)  of  PHD  overlay  strikingly  well  with  the 
two  RING  structures  (Figure  4).  An  a-carbon  backbone  alignment  of  RING  and  KAP-1  PHD 
structures  for  site  I  overlay  nearly  exactly,  as  do  the  zinc  atoms.  The  root  mean  square 
difference  (RMSD)  of  the  a  -carbon  backbone  for  residues  around  site  I  is  1.4A  when  compared 
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to  PML  and  1.3 A  when  compared  to  IEEHV  (Figure  4B).  Both  PML  and  IEEHV  RINGs  have  a 
large  insertion  along  the  Pi  strand  relative  to  KAP-1  PHD  (Figure  5).  Comparison  of  residues 
just  N-terminal  to  mlj  through  to  just  C-terminal  to  ml  6,  which  includes  residues  in  the 
hydrophobic  core,  results  in  RMSDs  between  KAP-1  and  PML  of  2.0A  and  between  KAP-1  and 
IEEHV  of  2.1  A.  The  sequence  identity  in  this  region  is  20%  between  PML  and  KAP-1  and  22% 
between  IEEHV  and  KAP-1.  Normally,  RMSD  values  of  ~2A  are  expected  for  proteins  with 
sequence  identities  greater  than  30%  (Flores  et  al,  1993).  Thus,  these  levels  of  similarity  are 
striking  considering  there  is  virtually  no  sequence  conservation  between  these  domains  outside 
of  their  conserved  zinc  ligands. 

Like  the  PHD  family,  the  RING  family  shows  substantial  structural  plasticity  between 
ligands  m/6  and  m/7  where  the  chain  completes  site  I  and  continues  on  towards  site  II.  In  PHD 
domains,  the  spacing  varies  from  12  to  46  residues,  and  in  RING  domains,  the  spacing  varies 
from  4  to  48  residues.  The  RINGs  of  IEEHV,  RAG1,  and  BRCA1  have  large  inserts  in  this 
region,  forming  a  helix.  However,  RINGs  with  shorter  inserts,  e.g.  PML  and  BARD1,  do  not 
have  a  helix  in  this  region  (information  for  BRCA1  and  BARD1,  personal  communication,  P. 
Brozvic  and  R.  Klevit;  reviewed  in  Borden,  1998).  In  contrast,  this  region  in  the  KAP-1  PHD 
contains  the  flexible  hinge  (residues  653-663).  Although  this  insert  is  comparable  in  size  to  that 
of  RINGs  containing  a  helix,  the  propensity  for  this  region  to  adopt  a  helical  conformation  is 
highly  unlikely  due  to  the  presence  of  two  proline  residues  (P654  and  P660). 

A  major  structural  difference  between  PHD  and  RING  domains  is  found  in  the 
hydrophobic  core.  Residues  required  for  the  hydrophobic  core  of  the  RING  structures  are  N- 
terminal  to  m/5  and  C-terminal  to  mh  (L76  and  L81  in  PML,  F28  and  133  in  IEEHV,  see  also 
Figures  4C  and  5)  (Barlow  et  al.,  1994;  Borden,  1998;  Borden  et  al,  1995a).  Conserved 
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hydrophobic  residues  at  these  exact  positions  are  also  found  in  the  PHD  (F647,  N-terminal  to 
ml5;  and  H652,  C-terminal  to  ml6).  However,  there  exists  an  additional  conserved  hydrophobic 
residue  in  PHD,  a  tryptophan  two  residues  N-terminal  to  mh  (W664  in  KAP-1;  Figures  4C  and 
5).  This  tryptophan,  which  is  highly  conserved  throughout  the  PHD  family,  inserts  between  the 
core  residues  N-terminal  to  mh  and  C-terminal  to  mh.  In  RINGs,  no  hydrophobic  core  residue 
exists  at  the  position  two  residues  before  mh  (Figures  4C  and  5).  Rather,  in  all  RING  structures 
reported,  the  amino  acid  at  this  position  is  actually  solvent  exposed  (reviewed  in  Borden,  1998). 
This  additional  core  residue  appears  specific  to  the  PHD  family.  Because  of  the  presence  of 
W664,  the  side-chains  of  F647  and  H652  are  oriented  differently  relative  to  the  RING,  although 
the  trace  of  the  main-chain  is  virtually  unaltered  relative  to  the  RING  fold  (Figure  4C).  Ligands 
mh  and  ml \  in  site  II  are  positioned  nearly  identically  in  both  PHDs  and  RINGs. 

Aside  from  differences  in  the  hydrophobic  core,  there  are  significant  differences  in  the 
main-chain  trace  between  ml 6  and  mh  in  PHD  compared  to  RING  domains.  At  H652,  just  C- 
terminal  to  mh,  the  main-chain  trace  significantly  deviates  from  RING  domains.  This  difference 
appears  to  result  from  the  participation  of  W664  (two  residues  N-terminal  to  mh)  in  the 
hydrophobic  core.  This  alters  the  position  of  the  mh  and  mh  relative  to  the  RINGs.  These 
differences  suggest  that  the  PHD  domain,  although  structurally  similar  to  the  RING  in  part,  could 
bind  structurally  distinct  partners  and  display  unique  biological  activities.  We  expect  that 
insertion  size  and  sequence  variability  between  mlj  to  mh  and  mh  to  mh  results  in  substantial 
structural  plasticity  in  PHD  domains,  imparting  specificity  onto  a  common  structural  scaffold 
stabilized  by  zinc. 

Sequence  Determinants  for  RING,  LIM  and  PHD  domains 
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The  structural  similarity  between  PHD  and  RING  domains,  coupled  to  the  dissimilarity 
with  LIM  domains,  prompted  us  to  examine  whether  there  were  any  structure-based  sequence- 
determinants  which  would  enable  one  to  predict  if  a  given  amino  acid  sequence  is  PHD,  RING 
or  LIM-like  in  structure.  The  major  structural  determinant  appears  to  be  based  on  the  zinc 
ligation  scheme  utilized.  Sequence  alignments  for  selected  members  of  the  PHD,  RING,  and 
LIM  families  are  given  in  Figure  5.  For  RING  and  LIM  domains,  conserved  hydrophobic  core 
residues  determined  through  inspection  of  the  structures  are  shown.  In  these  cases,  residues 
which  form  the  hydrophobic  core  are  shown  in  yellow,  site  I  in  magenta,  and  site  II  in  blue. 

These  alignments  suggest  that  proteins  with  conserved  hydrophobic  residues  at  positions  N- 
terminal  to  ml 5  and  C-terminal  to  ml 6  utilize  a  cross-brace  ligation  scheme  as  in  the  PHD/RING 
structures  whereas  proteins  with  hydrophobic  residues  N-terminal  to  ml 3  and  C-terminal  to  ml 4 
utilize  a  sequential  LIM-like  ligation  scheme.  Alignments  indicate  that  of  the  PHD/RING  cross¬ 
brace  schemes,  the  presence  of  a  tryptophan  residue  two  residues  N-terminal  to  mlj  may  be 
diagnostic  of  a  PHD  fold.  In  contrast,  LIMs  have  additional  conserved  hydrophobic  core 
residues  which  are  not  found  in  PHD  or  RING  (Figure  5).  Furthermore,  residue  spacing  in  the 
LIM  domain  is  much  more  highly  conserved  than  for  PHDs  or  RINGs  (Figure  1  A). 

For  PHD,  RING,  and  LIM,  flexibility  in  ligating  residue  type  has  been  observed,  e.g.  His 
can  replace  Cys  and  even  Asp  can  be  used  as  a  ligand.  Thus  the  position  of  specific  types  of  zinc 
ligands  is  unlikely  to  be  a  primary  determinant  of  the  zinc  ligation  pattern  and  therefore  the  fold 
of  the  molecule.  To  test  this  hypothesis,  we  swapped  the  type  of  zinc  ligand,  e.g.  His  for  Cys,  at 
ligand  position  ml5,  and  determined  the  ability  of  KAP-1  to  act  as  a  transcriptional  repressor 
(Figure  6B,  Table  II).  This  mutation  has  little  effect  indicating  that  the  type  of  ligating  residue, 
at  least  at  this  position,  is  not  crucial.  However,  substitution  of  Ala  for  His  at  ml5  completely 
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destroys  the  transcriptional  repression  properties  of  the  KAP-1  PHD  domain  (Figure  6B,  Table 
II). 

Structural  integrity  of  the  PHD  domain  is  essential  for  KAP-1  to  repress  transcription 

In  order  to  define  structural  features  of  the  PHD  domain  important  for  the  transcriptional 
repression  by  KAP-1,  mutational  analysis  was  carried  out  on  a  GAL4-KAP-1  (619-835) 
expression  construct  (Figure  6A).  This  construct  encodes  both  the  PHD  domain  and  the 
Bromodomain  of  KAP-1  which  together  comprise  an  independent  repression  domain  (Figure  6 
panels  B  and  C,  Table  II,  and  Schultz  et  al.,  submitted).  Mutations  of  zinc  ligands  in  either  site  I 
(CC628,  631AA  or  H648A)  or  site  II  (C643A  and  C666A)  abolished  transcriptional  repression 
(Figure  6,  Table  II),  while  mutation  of  the  non-zinc  ligating  cysteine  (C646V)  shows  no  effect. 

A  mutation  which  potentially  introduces  an  extra  zinc  ligand  (M639C)  results  in  a  lowered 
activity,  presumably  because  this  ‘new’  ligand  may  disrupt  the  formation  of  the  cross-brace  by 
competition  for  zinc.  Further,  mutations  that  disrupt  the  hydrophobic  core  of  the  molecule, 
W664A,  are  as  debilitating  as  mutations  that  disrupt  zinc  ligation.  Consistently,  conservative 
non-disruptive  mutations  in  the  core,  F647Y,  do  not  alter  transcriptional  repression  by  KAP-1 . 
Thus,  the  ability  of  the  KAP-1  PHD  domain  to  bind  zinc  and  maintain  the  hydrophobic  core  are 
both  required  for  its  proper  folding  and  ability  to  repress  transcription. 

The  potential  structural  consequences  of  PHD  domain  mutations  in  other  proteins 

Naturally  occurring  mutations  and  deletions  in  the  PHD  domain  of  several  proteins 
exemplify  the  biological  importance  of  this  domain.  Germline  missense  mutations  in  the  PHD 
domain  of  the  ATRX  protein  predispose  individuals  to  a-thalasemia  and  mental  retardation 
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(Gibbons  et  al.,  1997).  Somatic  missense  mutations  found  in  the  PHD  domain  of  ING1  are 
detected  in  several  types  of  carcinomas  (Gunduz  et  al.,  2000).  Analysis  of  these  mutations  in 
light  of  the  KAP-1  PHD  structure  reveal  the  basis  for  their  devastating  nature  (Table  II).  In  the 
ING1  PHD  domain,  two  mutations  of  site  I  are  found  in  squamous  cell  carcinomas  (Gunduz  et 
al.,  2000).  C63 1 S  (where  this  amino  acids  are  numbered  in  the  context  of  the  KAP-1  PHD),  like 
previous  zinc-ligand  mutations,  should  destroy  zinc-binding  and  structure  formation.  N632S, 
adjacent  to  ml2,  may  indirectly  interfere  with  zinc-binding  and  therefore,  formation  of  site  I. 

Two  mutations  in  ATRX  (C628R  and  C651F)  disrupt  the  ability  of  this  domain  to  ligate  zinc. 
These  mutations  are  expected  to  result  in  an  unfolded  protein,  and  equivalent  mutations  in  KAP- 
1  PHD  domain  results  in  a  loss  of  repression  (Table  II).  Another  mutation  in  ATRX  introduces 
an  additional  potential  metal-ligand  (P654C)  which  may  disrupt  biological  activity  through 
misfolding.  Some  surface  mutations  (V630S  and  Q657E)  have  no  effect  on  transcriptional 
activity  in  these  assays;  however,  these  mutations  are  found  in  ATRX  patients.  Thus,  these 
residues  may  be  required  by  the  ATRX  protein  for  mediating  protein-protein  interactions.  An 
interesting  feature  of  the  wild-type  ATRX  PHD  domain  is  that  it  is  a  PHD  domain  with  eight 
cysteines  (His  at  ml5  is  replaced  by  Cys).  Mutation  of  this  ligand  in  KAP-1  (H648C)  resulted  in 
only  a  slight  loss  of  transcriptional  activity  (Figure  6B)  suggesting  that  either  a  histidine  or  a 
cysteine  can  be  tolerated  as  a  metal-ligand  at  this  position. 

RING-like  biological  activities  of  the  PHD  domain 

Since  KAP-1  PHD  has  many  "RING-like"  features  including  the  cross-brace  zinc  ligation 
scheme  and  a  high  level  of  structural  similarity  to  the  anti-parallel  P-strands  and  zinc-binding  site 
I,  we  conducted  studies  to  determine  whether  this  domain  had  any  RING-like  biological  activity. 
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Features  characteristic  of  RINGs  include:  (I)  requirement  of  zinc  for  folding  and  biological 
activity  and  (II)  formation  of  large  multi-protein  complexes  by  acting  as  scaffolds  for  protein- 
protein  interactions  (Borden,  2000;  Kentsis  and  Borden,  2000).  Like  RING,  we  show  that  KAP-1 
PHD  requires  zinc  for  folding  and  function.  KAP-1  PHD  participates  in  protein  protein 
interactions  with  Mi-2a  (Schultz  et  al.,  submitted),  a  component  of  the  NuRD  histone 
deacetylase  complex  (Brehm  et  al,  1999).  Therefore,  KAP-1  PHD  may  be  important  for  proper 
spatial  and  temporal  scaffolding  of  the  repressor  complex.  This  association  is  likely  to  be  critical 
for  transcriptional  repression  by  KAP-1.  Consistent  with  our  results,  recent  work  indicates  that 
the  AIRE  PHD  domain  is  required  for  formation  of  nuclear  protein  complexes  (Rinderle  et  al., 
1999).  Further,  the  AF10  PHD  domain  is  reported  to  be  a  homo-oligomerization  module  (Linder 
et  al.,  2000).  Like  RINGs,  the  PHD  domain  appears  to  function  as  a  protein  protein  interaction 
motif.  Presumably,  disruption  of  the  structural  integrity  of  PHD  domain  by  mutation  of  critical 
residues  fundamentally  disrupts  this  action. 

Recent  studies  have  shown  that  some  RING  domains  function  as  E3  ubiquitin  ligases 
(reviewed  in  (Borden,  2000)).  E3  ubiquitin  ligases  are  components  of  the  ubiquitin  degradation 
pathway.  Recent  structural  data  indicate  that  a  helix  between  ml6  and  ml?  is  involved  in  making 
the  major  contacts  between  the  Cbl  RING  and  the  ubiquitin  machinery  (Zheng  et  al.,  2000). 
Because  of  the  similarity  to  the  RING,  we  tested  KAP-1  PHD  in  this  system.  However,  the  PHD 
domain  has  no  detectable  E3  ligase  activity  (data  not  shown).  This  result  is  consistent  with  our 
structural  data  which  indicate  that  there  is  no  helix  present  between  ml6  and  ml7.  It  will  be 
necessary  to  test  several  other  PHD  domains  before  this  activity  can  be  ruled  out  for  the  family 
as  a  whole. 
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Conclusions 

We  present  the  first  structure  of  a  PHD  domain  and  show  it  is  an  autonomously 
structured  domain  that  requires  zinc  for  folding.  From  sequence  analysis  alone,  it  is  impossible 
to  determine  how  the  eight  conserved  metal  ligands  are  utilized  to  bind  zinc.  This  uncertainty  is 
evident  from  a  recently  modeled  structure  of  the  DNMT3L  PHD  domain  where  the  ligation 
scheme  was  assumed  to  be  LIM-like  (Aapola  et  al.,  2000).  The  actual  ligation  of  zinc  by  the 
PHD  domain  is  accomplished  by  a  cross-brace  motif,  like  that  used  by  the  RING  family.  Not 
only  are  the  ligation  schemes  similar  between  the  two  domains,  but  the  structures  are 
topologically  identical  from  ml\  to  ml 6,  which  includes  the  first  zinc-binding  site  and  the  anti¬ 
parallel  (3-strands.  We  identify  structural  determinants  that  distinguish  ligation  schemes  of 
RING,  LIM  and  PHD  domains.  In  the  structure  of  the  KAP-1  PHD  domain,  there  exists  a 
flexible  hinge  region  between  ml6  and  ml7  characterized  by  a  discrete  negatively  charged  surface. 
Amongst  the  RING  domains,  this  same  region  shows  structural  variability,  forming  an  extended 
strand  or  a  helix.  The  plasticity  of  this  region  may  underlie  the  biological  specificity  of  these 
rather  similar  domains.  The  PHD  domain  is  required  for  transcriptional  repression  by  KAP-1  and 
we  present  structural  determinants  required  for  this  activity,  which  elucidate  the  structural  basis 
of  disease-causing  missense  mutations  in  ATRX  and  ING1. 

Materials  and  methods 
Preparation  of  PHD  domain 

A  DNA  fragment  encoding  the  PHD  domain  of  human  KAP-1  (amino  acids  618-679)  was 
subcloned  into  the  pQE30  expression  vector  (Qiagen),  and  expressed  in  E.  coli  BL21(DE3)  cells 
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(Novagen).  The  His-tagged  protein  contained  an  additional  17  N-terminal  amino  acids, 
MRGSHHHHHHGSDIIDE,  amino  acids  618-679  of  hKAP-1,  and  9  C-terminal  amino  acids, 
VDLQACKLN.  For  the  untagged  KAP-1  PHD  domain  protein,  a  DNA  fragment  encoding 
amino  acids  618-679  was  subcloned  into  the  expression  vector  pQE50  (Qiagen),  and  expressed 
in  E.  coli  SGI 3009  cells  (Qiagen).  The  expressed  protein  contained  4  NH2-terminal  amino 
acids,  MRGS,  followed  by  amino  acids  618  to  679  of  hKAP-1.  His-tagged  or  untagged  protein 
was  prepared  from  logarithmically  growing  bacteria  cultured  in  either  2YT  media  or  in  minimal 
media,  containing  15NH4C1  as  the  sole  nitrogen  source  and  supplemented  with  100  gM  zinc 
acetate  (for  uniformly  labeled  15N-protein).  In  both  types  of  media,  protein  expression  was 
induced  with  1  mM  IPTG  for  3  hours  at  37°C.  Induced  bacterial  pellets  were  lysed  in  20  mM 
NaH2P04,  pH  7.5,  500  mM  NaCl,  5mM  DTT.  The  His-tag  PHD  domain  was  purified  by  affinity 
chromatography  on  a  Ni2+-NTA-agarose  column  (Qiagen),  followed  by  fractionation  through  a 
Superdex  75  size  exclusion  column  (Pharmacia).  The  untagged  PHD  domain  was  purified 
sequentially  by  ion  exchange  chromatography  (DEAE-cellulose),  hydrophobic  interaction 
chromatography  (Butyl-sepharose),  and  size  exclusion  chromatography  (Superdex  75).  NMR 
experiments  utilized  both  constructs,  while  metal  binding  analysis  (i.e.  ICP  and  CD)  used  only 
the  untagged  PHD  construct. 

Site-directed  mutagenesis 

Site-directed  point  mutations  in  the  KAP-1  PHD  domain  were  engineered  by  standard  overlap 
extension  PCR-mediated  mutagenesis  procedures.  The  mutagenic  primers  for  the  described 
mutations  (Figure  6)  contained  the  following  codons:  CC628,63 1 AA,  TGC  to  GC;  C628R,  TGC 
to  CGC;  V630S,  GTC  to  AGC;  G635A,  GGC  to  GCC;  D636A,  GAT  to  GCT;  M639C,  ATG  to 
TGC;  C643A,  TGT  to  GCT;  C646V,  TGT  to  GTT;  F647Y,  TTC  to  TAC;  H648C,  CAC  to  TGC; 


17 


C651F,  TGT  to  TTT;  P654C,  CCG  to  TGC;  Q657E,  CAG  to  GAG;  P660A,  CCA  to  GCA; 
W664A,  TGG  to  GCG;  C666A,  TGC  to  GCC;  SL667,668TF,  TCA  and  CTC  to  ACA  and  TTC. 
Appropriate  reading  frame  fusions  and  integrity  of  flanking  sequences  for  all  constructs  created 
by  PCR  was  confirmed  by  DNA  sequence  analysis  of  both  strands. 

Transient  transfections 

Protein  expression  from  all  plasmids  was  confirmed  by  transient  transfection  of  COS-1  cells 
followed  by  immunoprecipitation  of  35S-methionine  labeled  cell  extracts.  All  transcription  assay 
transfections  were  done  as  previously  described  (Ryan  et  al.,  1999). 

Inductively  coupled  plasma  spectrometry 

Inductively  coupled  plasma  spectrometry  was  conducted  using  the  untagged  PHD  sample  at 
Galbraith  Laboratories,  Inc.  (Knoxville,  TN,  USA).  25  pi  of  a  3.0  mM  PHD  sample  (20  mM 
NaH2P04,  500  mM  NaCl,  5mM  DTT,  pH  7.5)  was  prepared  for  ICP  analysis  using  a  wet  ash 
digestion  procedure  previously  described  (Bock,  1979).  Measurements  were  made  on  a  Perkin- 
Elmer  P2000  using  a  primary  wavelength  of  213.856  nm  (Wallace,  1981). 

Circular  dichroism  spectroscopy 

Spectra  were  collected  on  a  Jasco  810  spectropolarimeter  at  25°  using  a  1  mm  path-length 
cuvette.  The  concentration  of  the  untagged  PHD  sample  was  30  pM  in  20  mM  NaH2P04  and  50 
mM  NaCl,  pH  7.5.  After  an  initial  spectrum  was  collected,  a  6-fold  molar  excess  of  EDTA  was 
added  and  pH  was  adjusted  to  pH  7.5.  The  sample  was  then  left  at  room  temperature  for  16 
hours  to  ensure  it  had  reached  equilibrium.  Spectra  were  then  recorded. 

NMR  spectroscopy 

For  NMR,  KAP-1  PHD  domain  was  1. 5-3.0  mM  in  20  mM  NaH2P04,  500  mM  NaCl,  5  mM 
DTT  at  pH7.5.  No  spectral  changes  were  observed  in  this  sample  concentration  range.  Spectra 
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were  recorded  at  30°C  on  a  Bruker  DRX500  spectrometer.  Sequential  assignments  were 
obtained  using  3D  !H/15N  NOESY-HSQC,  3D  'H/I5N  HMQC-NOESY-HSQC,  and  3D  *H/15N 
TOCSY-HSQC  spectra.  Side-chain  resonances  were  assigned  using  3D  ’H/,5N  TOCSY-HSQC, 
2D  ‘H-TOCSY,  and  2D  *H  DQF-COSY  spectra.  Aromatic  protons  were  assigned  though  2D  1H- 
TOCSY,  2D  'H-NOESY,  and  2D  !H  DQF-COSY  spectra  recorded  in  2H20.  2D  data  were 
collected  and  analyzed  as  described  (Borden  etal.,  1995a;  Borden  et  al.,  1995b).  For  side-chain 
and  aromatic  assignments,  a  construct  with  no  His  tag  was  used.  NOE-derived  distance 
restraints  were  obtained  from  3D  *H/15N  NOESY-HSQC  spectra  (xm  of  75  and  150  ms)  and  2D 
'H-NOESY  spectra  (xm  of  75  and  150  ms).  0-angle  restraints  were  determined  based  on  the 
Vhn,h<x  coupling  constants  measured  in  a  3D  HNHA  spectrum.  Slowly  exchanging  amide  protons 
were  identified  from  2D  15N-HSQC  spectra  recorded  12  hours  after  the  'H2O  buffer  was 
exchanged  into  a  2H20.  Detailed  descriptions  of  these  experiments  along  with  their  original 
references  have  been  reviewed  elsewhere  (Cavanagh,  1996;  Clore  and  Gronenbom,  1994).  All 
NMR  data  were  processed  using  the  NMRpipe  software  system  (Delaglio  et  al.,  1995)  and 
analyzed  with  the  program  NMRview  (Johnson,  1994). 

Structure  calculations 

Structures  were  calculated  with  a  distance  geometry/simulated  annealing  protocol  in  the  X- 
PLOR  v  3.851  (Brunger,  1996).  Initial  structure  calculations  were  performed  without  any 
assumptions  about  zinc  coordination.  Initial  structures  indicated  a  cross-brace  zinc  ligation 
scheme  (see  Results  and  Discussion).  Subsequent  structure  calculations  included  zinc  atoms 
with  additional  distance  and  angle  constraints  to  maintain  the  tetrahedral  bonding  geometry  of 
the  sites  and  appropriate  bond  lengths  as  previously  described  by  (Neuhaus  et  al.,  1992).  Other 
alternative  ligation  schemes  were  tested  but  were  unable  to  satisfy  the  experimentally  derived 
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constraints.  Structures  from  these  calculations  were  used  for  an  automated,  iterative  assignment 
of  remaining  NOEs  using  ARIA  (Nilges,  1995;  Nilges  et  al.,  1997).  In  addition  to  NOE  data, 
calculations  used  a  total  of  6  hydrogen  bonds  and  20  <>-dihedral  restraints.  Quality  of  the  final 
structures  were  assessed  with  PROCHECK-NMR  (Laskowski  et  al,  1996).  Figures  were 
prepared  using  the  program  PREPI. 

Sequence  and  structure  comparisons 

Structural  comparisons  and  RMS  difference  calculations  between  the  PHD,  PML  (lbor)  and 
IEEHV  (lchc)  were  performed  using  the  least  squares  fitting  algorithm  LSQFIT.  The  graphical 
overlays  were  viewed  using  PREPI.  PREPI  and  LSQFIT  were  kindly  provided  by  S.  Islam  and 
M.  Sternberg,  Imperial  Cancer  Research  Fund.  Sequence  alignments  were  done  using  Clustal  W 
1.7. 
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Figure  Legends 

Figure  1.  (A)  The  consensus  sequences  which  define  the  PHD,  RING,  and  LIM  domains.  C 
indicates  cysteine,  H  histidine,  and  X  indicates  any  residue.  Above  the  consensus  sequence  is 
given  the  number  of  each  metal-ligand,  ml.  In  dark  blue  are  the  first  and  second  pair  of 
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sequential  metal  ligands,  while  in  magenta  are  the  third  and  fourth  pair.  (B)  Demonstration  of 
zinc  ligation  patterns  found  in  RING  and  LIM  domains.  The  RING  uses  a  cross-brace  ligation 
scheme  while  the  LIM  uses  a  sequential  ligation  scheme.  Numbers  correlate  to  the  number  of 
the  metal  ligand  as  defined  in  A.  Zinc  atoms  are  represented  by  grey  ovals  and  the  zinc-binding 
sites  are  denoted  by  roman  numerals.  (C)  Below  are  ribbon  diagrams  of  the  three-dimensional 
structures.  Zinc  atoms  are  represented  by  spheres.  Structures  are  from  the  RING  of  PML 
(1BOR)  and  for  LIM  (1 A7I). 

Figure  2.  (A)  Circular  dichroism  spectrum  of  untagged  KAP-1  PHD  prior  to  addition  of  EDTA 
(solid)  and  after  addition  (dashed).  (B  and  C)  'H/1sN-HSQC  of  KAP-1  PHD  produced  in 
minimal  media  not  supplemented  with  zinc  (B)  or  supplemented  with  zinc  (C). 

Figure  3.  The  KAP-1  PHD  structure.  (A)  a-Carbon  overlay  often  KAP-1  PHD  domain 
structures.  The  residues  N-terminal  to  1627  and  C  terminal  to  H670  are  disordered  and  not 
shown.  The  average  position  of  the  two  zinc  atoms  are  represented  by  white  spheres.  The  blue 
portion  of  the  ensemble  represents  the  flexible  hinge  region  described  in  the  text.  Structural 
statistics  are  given  in  Table  I.  (B)  All  atom  view  of  KAP-1  PHD  in  the  same  orientation  as  in  A. 
Main-chain  is  colored  in  gray.  Side-chains  are  colored  as  follows:  hydrophobic  in  yellow,  polar 
(non-charged)  in  cyan,  polar  (acidic)  in  red,  and  polar  (basic)  in  blue.  (C  and  D)  Ribbon 
diagram  of  the  KAP-1  PHD  domain.  The  p-strands  are  shown  as  arrows  and  zinc  ligands  in 
green.  (E)  Sequence  alignment  of  PHD  domains  and  a  schematic  depicting  zinc  ligation. 

Figure  4.  Comparison  of  the  KAP-1  PHD,  PML  RING  and  IEEHV  RING.  (A)  Superposition 
of  KAP-1  PHD  (with  blue  P-strands,  a.a.  627-652)  and  PML  RING  (in  magenta  P-strands,  a.a. 

56  to  76)  from  the  first  metal-ligand  to  the  sixth  metal-ligand.  The  white  spheres  represent  zinc 
atoms  with  the  upper  zinc  atom  being  site  I.  (B)  Superposition  of  zinc-binding  site  I  for  KAP-1 
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(627-632,  647-652),  PML  (56-71,  76-81),  and  IEEHV  (7-12,  28-33).  The  metal  ligands  are 
colored  according  to  protein;  KAP-1  in  blue,  PML  in  magenta  and  IEEHV  in  green.  (C) 
Superposition  of  the  conserved  hydrophobic  core  residues  N-terminal  to  metal-ligand  5  and  C- 
terminal  to  metal-ligand  6  (L76  and  L81  in  PML,  F28  and  133  in  IEEHV,  F647  and  H652  in 
PHD).  The  side-chains  are  colored  as  in  B.  The  core  residues  from  PHD  are  noted  for  clarity. 
The  conserved  tryptophan  within  the  PHD  family  (W664  in  KAP-1)  is  seen  here  inserting 
between  the  other  core  residues,  repositioning  the  core. 

Figure  5.  Amino  acid  sequence  alignment  of  the  PHD,  RING,  and  LIM  families.  Site  I  metal 
ligands  are  colored  in  magenta  and  site  II  metal  ligands  in  blue.  Conserved  hydrophobic  core 
residues  are  colored  in  yellow.  Accession  and  PDB  codes:  PML  (lbor),  IEEHV  (lchc),  RAG1 
(lrmd),  CrplLIMl  &  LIM2  (1B8T),  CRIPrat  (1IML),  KAP-1  (x99644),  TIFlp  (s78219),  Mi-2a 
(x86691),  HuATRX  (u72937),  BARD  (NP_00456),  DNMTL  (AF194032),  BRCA1  (A58881), 
ING1  (AAF37421). 

Figure  6.  Mutations  in  the  PHD  domain  of  KAP-1  significantly  impair  the  intrinsic  repression 
activity  of  KAP-1.  (A)  Schematic  diagram  illustrating  the  reporter  plasmid  (5X-GAL4-UAS-TK- 
Luciferase)  and  effector  plasmid  (GAL4-KAP-l6i9-835  )•  Primary  amino  acid  sequence  of  the 
minimal  KAP-1  finger  is  illustrated.  The  pound  (#)  symbol  represents  conserved  amino  acids 
that  were  mutated.  Dollar  signs  ($)  represent  non-conserved  amino  acids  in  KAP-1  which  were 
mutated  to  match  the  corresponding  amino  acids  in  TIFloc/TIFly.  Each  mutation  was  made  in 
the  context  of  a  GAL4-KAP-1  (619-835)  expression  construct.  (B)  Mutations  in  the  KAP-1 
PHD  domain  disrupt  its  role  in  transcriptional  repression.  All  experiments  were  done  in 
NIH/3T3  cells  with  5pg  of  the  indicated  GAL4-fusion  protein  and  lpg  of  a  5x-GAL4(UAS)-TK- 
luciferase  reporter.  A  black  bar  represents  wild-type,  grey  bars  represent  mutations  of  conserved 
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amino  acid  residues,  and  white  bars  represent  substitutions  at  non-conserved  amino  acids.  (C) 
Stable  expression  of  each  protein  was  determined  via  transfection  into  COS1  cells  followed  by 
immunoprecipitation  of  35S-met  labeled  whole  cell  extracts  with  anti-GAL4  (DBD)  antiserum  (1 

Pg)- 
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Table  I.  Structural  statistics 

Restraints  for  structure  calculation  (residues 

627-670) 

Total  Restraints 

385 

Total  NOE  restraints 

317 

Intra-residue 

78 

Sequential 

118 

Medium  range  (2  <  |i-y|  <  4) 

26 

Long-range  (|/-j|  >  5) 

95 

Dihedral  angle  <]) 

20 

H-bond  restraints 

12 

Zinc  distance  restraints 

20 

Zinc  angle  restraints 

16 

Final  energies  (kcal/mol) 

E total 

37.1  ±  1.1 

Enoe 

0.30  ±0.21 

Ecdih 

0.05  ±  0.04 

Coordinate  precision  (627-653,  664-670) 

0.73  ±0.16  A 

R.m.s.d.  of  backbone  atoms 

R.m.s.d.  of  all  heavy  atoms 

1.48  ±0.18  A 

Procheck  analysis  (627-670) 

Most  favoured  regions 

36.0% 

Additional  allowed 

45.9% 

Generously  allowed 

14.9% 

Disallowed  region 

2.4% 

Table  II.  PHD  mutations  and  structural  consequences _ 

Mutations  Location  in  KAP-1  PHD  structure  Transcriptional  Structural  consequences 

_  repressiona  (%) _ 

CC628,631AA  Zn2+-ligands  of  site  I  17  loss  of  metal  binding,  incorrect  folding 

G635A  Surface,  extended  region  before  Pi  87  no  obvious  structural  effect 
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